Lecture Notes in Computer Science, 2003, Volume 2663/2003, 955, DOI: 10.1007/3-540-44831-4_25

Carrot2 and Language Properties in Web Search Results Clustering

Jerzy Stefanowski and Dawid Weiss

View Related Documents

Abstract

This paper relates to a technique of improving results visualization in Web search engines known as search results clustering. We introduce an open extensible research system for examination and development of search results clustering algorithms — Carrot2. We also discuss attempts to measuring quality of discovered clusters and demonstrate results of our experiments with quality assessment when inflectionally rich language (Polish) is clustered using a representative algorithm - Suffix Tree Clustering.

Keywords  information retrieval - web browsing and exploration - web search clustering - suffix tree clustering

Fulltext Preview

Image of the first page of the fulltext document