Lecture Notes in Computer Science, 2002, Volume 2406/2002, 278-287, DOI: 10.1007/3-540-45691-0_25

Stemming Evaluated in 6 Languages by Hummingbird SearchServer™ at CLEF 2001

Stephen Tomlinson

View Related Documents

Abstract

Hummingbird submitted ranked result sets for all 5 Monolingual Information Retrieval tasks (German, French, Italian, Spanish and Dutch) of the Cross-Language Evaluation Forum (CLEF) 2001. Search-Server’s Intuitive Searching™ produced the highest average precision score in the German task of the 12 groups submitting automatic, Title +Description runs. Enabling stemming in SearchServer increased average precision by 43% in German, 30% in Dutch, 18% in French, 16% in Italian, 12% in Spanish and 12% in English. All points in the 95% confidence interval for the impact of stemming on average precision in German (based on the two-sided Wilcoxon signed rank test) were greater than all points in the corresponding intervals for French, English and Italian, evidence that stemming is more beneficial in German than in French, English or Italian.

Fulltext Preview

Image of the first page of the fulltext document