Volume 21, Number 1, 83-101, DOI: 10.1007/s11002-009-9083-4

Evaluation of structure and reproducibility of cluster solutions using the bootstrap

Sara Dolnicar and Friedrich Leisch

View Related Documents

Abstract

Segmentation results derived using cluster analysis depend on (1) the structure of the data and (2) algorithm parameters. Typically, neither the data structure nor the sensitivity of the analysis to changes in algorithm parameters is assessed in advance of clustering. We propose a benchmarking framework based on bootstrapping techniques that accounts for sample and algorithm randomness. This provides much needed guidance both to data analysts and users of clustering solutions regarding the choice of the final clusters from computations that are exploratory in nature.

Keywords  Cluster analysis - Mixture models - Bootstrap

Fulltext Preview

Image of the first page of the fulltext document