Seminář: Frequent patterns for Natural Language Processing
Datum a čas | 11. 5. 2006 10:30 - 12:00 |
---|---|
Místnost | 403 NB |
Frequent patterns for Natural Language Processing
Prezentující: Luboš Popelínský
Frequent patterns mining is one of the most important tasks in descriptive data mining. Frequent patterns has also been successfully used for data preprocessing and classification. The content of the presentation is as follows. Firstly, we define frequent patterns both in propositional and first order logic and mention algorithms for mining them. We also define emerging and jumping emerging patterns. Then we briefly describe RAP, a system for mining long first–order frequent patterns in multi–relation data. After we depict tRAPe, a general framework for frequent patterns mining in text. Two methods of using patterns will be described: feature construction (propositionalization) and classification based on associations (CBA). We describe experiments with tRAPe for information extraction from biological texts, context–sensitive text correction for English and morphological disambiguation of Czech. Related resources: * http://www.fi.muni.cz/kd/projects/rap/ * http://www.fi.muni.cz/kd