Seminar: Strigil: A Framework for Data Extraction

Date and time 28. 3. 2013 10:30 - 12:00
Room 336 RB

Strigil: A Framework for Data Extraction

Speakers: Jakub Stárka

The talk describes Strigil, a system for script-driven data retrieval from textual or weak-structured documents. In particular we identify the most common problems and we discuss some of the possible solutions and workarounds. Additionally the talk involves a description of our proposed scripting language, which allows to define the way how the data should be extracted.