A Very Large Database of Collocations and Semantic Links
Igor Bolshakov6
and Alexander Gelbukh6 
| (6) |
Center for Computing Research, National Polytechnic Institute, Av. Juan Dios Bátiz s/n esq. Mendizabal col. Zacatenco, 07738 México DF., Mexico |
Abstract
A computational system manages a very large database of colloca- tions (word combinations) and semantic links. The collocations
are related (in the meaning of a dependency grammar) word pairs, joint immediately or through prepositions. Synonyms, antonyms,
subclasses, superclasses, etc. repre- sent semantic relations and form a thesaurus. The structure of the system is uni- versal,
so that its language-dependent parts are easily adjustable to any specific language (English, Spanish, Russian, etc.). Inference
rules for prediction of highly probable new collocations automatically enrich the database at runtime. The inference is assisted
by the available thesaurus links. The aim of the system is word processing, foreign language learning, parse filtering, and
lexical dis- ambiguation.
Keywords dictionary - collocations - thesaurus - syntactic relations - semantic relations - lexical disambiguation.
References secured to subscribers.