Seminář: Strigil: A Framework for Data Extraction

Datum a čas 28. 3. 2013 10:30 - 12:00
Místnost 336 RB

Strigil: A Framework for Data Extraction

Prezentující: Jakub Stárka

The talk describes Strigil, a system for script-driven data retrieval from textual or weak-structured documents. In particular we identify the most common problems and we discuss some of the possible solutions and workarounds. Additionally the talk involves a description of our proposed scripting language, which allows to define the way how the data should be extracted.