Lecture Notes in Computer Science, 2006, Volume 4029/2006, 643-652, DOI: 10.1007/11785231_67

An Accurate MDS-Based Algorithm for the Visualization of Large Multidimensional Datasets

Antoine Naud

View Related Documents

Abstract

A common task in data mining is the visualization of multivariate objects on scatterplots, allowing human observers to perceive subtle inter-relations in the dataset such as outliers, groupings or other regularities. Least- squares multidimensional scaling (MDS) is a well known Exploratory Data Analysis family of techniques that produce dissimilarity or distance preserving layouts in a nonlinear way. In this framework, the issue of visualizing large multidimensional datasets through MDS-based methods is addressed. An original scheme providing very accurate layouts of large datasets is introduced. It is a compromise between the computational complexity O(N 5/2) and the accuracy of the solution that makes it suitable both for visualization of fairly large datasets and preprocessing in pattern recognition tasks.

Fulltext Preview

Image of the first page of the fulltext document