Lecture Notes in Computer Science, 2005, Volume 3739/2005, 464-474, DOI: 10.1007/11563952_41

An Auto-stopped Hierarchical Clustering Algorithm Integrating Outlier Detection Algorithm

Tian-yang Lv, Tai-xue Su, Zheng-xuan Wang and Wan-li Zuo

View Related Documents

Abstract

It is a critical problem for the clustering analysis techniques to select the appropriate value of parameters. Meanwhile, the clustering algorithms lack the effective mechanism to detect outliers while treating outliers as “noise”. By regarding outliers as valuable information, the paper proposes a novel hierarchical clustering algorithm that integrates a new outlier-mining method. The algorithm stops clustering according to the dissimilarity reflected by the detected outliers and needs only one parameter, whose appropriate value can be decided in the outlier mining process. After discussing some related topics, the paper adopts 5 real-life datasets to evaluate the performance of the clustering algorithm in outlier mining and clustering and compare it with other algorithms.

Keywords  Clustering - Outlier Mining - Auto Stop

Fulltext Preview

Image of the first page of the fulltext document