Seminář: Strigil: A Framework for Data Extraction
Datum a čas | 28. 3. 2013 10:30 - 12:00 |
---|---|
Místnost | 336 RB |
Strigil: A Framework for Data Extraction
Prezentující: Jakub Stárka
The talk describes Strigil, a system for script-driven data retrieval from textual or weak-structured documents. In particular we identify the most common problems and we discuss some of the possible solutions and workarounds. Additionally the talk involves a description of our proposed scripting language, which allows to define the way how the data should be extracted.