Seminář: Strigil: A Framework for Data Extraction
|Datum a čas||28. 3. 2013 10:30 - 12:00|
Strigil: A Framework for Data Extraction
Prezentující: Jakub Stárka
The talk describes Strigil, a system for script-driven data retrieval from textual or weak-structured documents. In particular we identify the most common problems and we discuss some of the possible solutions and workarounds. Additionally the talk involves a description of our proposed scripting language, which allows to define the way how the data should be extracted.