Seminář: Frequent patterns for Natural Language Processing

Datum a čas 11. 5. 2006 10:30 - 12:00
Místnost 403 NB

Frequent patterns for Natural Language Processing

Prezentující: Luboš Popelínský

Frequent patterns mining is one of the most important tasks in descriptive data mining. Frequent patterns has also been successfully used for data preprocessing and classification. The content of the presentation is as follows. Firstly, we define frequent patterns both in propositional and first order logic and mention algorithms for mining them. We also define emerging and jumping emerging patterns. Then we briefly describe RAP, a system for mining long first–order frequent patterns in multi–relation data. After we depict tRAPe, a general framework for frequent patterns mining in text. Two methods of using patterns will be described: feature construction (propositionalization) and classification based on associations (CBA). We describe experiments with tRAPe for information extraction from biological texts, context–sensitive text correction for English and morphological disambiguation of Czech. Related resources: * http://www.fi.muni.cz/kd/projects/rap/ * http://www.fi.muni.cz/kd