View Related Documents

Abstract

Manual construction of a wordnet can be facilitated by a system that suggests semantic relations acquired from corpora. Such systems tend to produce many wrong suggestions. We propose a method of filtering a raw list of noun pairs potentially linked by hypernymy, and test it on Polish. The method aims for good recall and sufficient precision. The classifiers work with complex features that give clues on the relation between the nouns. We apply a corpus-based measure of semantic relatedness enhanced with a Rank Weight Function. The evaluation is based on the data in Polish WordNet. The results compare favourably with similar methods applied to English, despite the small size of Polish WordNet.

Keywords  lexical-semantic relations - measures of semantic relatedness - wordnet construction - Polish WordNet - nouns - hypernymy extraction - supervised Machine Learning - classifiers - Rank Weight Function - filtering

Fulltext Preview

Image of the first page of the fulltext document