Research Papers
Dictionary-Free Morphological Classifier of Russian Nouns
Igor A. Bolshakov1
and Elena I. Bolshakova2 
| (1) |
Center for Computing Research (CIC), National Polytechnic Institute (IPN), Mexico City, Mexico |
| (2) |
Moscow State Lomonosov University, Faculty of Computational Mathematics and Cybernetics, Moscow, Russia |
Abstract
A dictionary-free morphological classifier of nouns for a highly inflective language is developed. The classifier is a front-end
utility for acquiring a very large DB of Russian collocations and WordNet-like semantic links. For its main functions, the
classifier uses the final letters of standard noun forms and extensive morphological and lexical data. The percentage of nouns
correctly classified in a standalone manner is now 99.65%. A completely error-free performance is impossible for context-free
methods in principle, primarily because of homonymy: the nouns of various senses may decline in different ways. Therefore
the classifier’s results are additionally tested against more than 200,000 collocations stored in the DB and, when it is necessary,
are automatically corrected.
Work done under partial support of Mexican Government (CONACyT, SNI, SIP-IPN) and Russian Foundation of Fundamental Research
(RFFI, the grant 06-01-00571). Many thanks to Steve Legrand for good advice and proofreading.