Volume 27, Number 4, 273-294, DOI: 10.1007/s10462-008-9082-5

An evaluation of dimension reduction techniques for one-class classification

Santiago D. Villalba and Pádraig Cunningham

From the issue entitled "Special Issue on the 18th Artificial Intelligence and Cognitive Science Conference (AICS 2007): Part I"

View Related Documents

Abstract

Dimension reduction (DR) is important in the processing of data in domains such as multimedia or bioinformatics because such data can be of very high dimension. Dimension reduction in a supervised learning context is a well posed problem in that there is a clear objective of discovering a reduced representation of the data where the classes are well separated. By contrast DR in an unsupervised context is ill posed in that the overall objective is less clear. Nevertheless successful unsupervised DR techniques such as principal component analysis (PCA) exist—PCA has the pragmatic objective of transforming the data into a reduced number of dimensions that still captures most of the variation in the data. While one-class classification falls somewhere between the supervised and unsupervised learning categories, supervised DR techniques appear not to be applicable at all for one-class classification because of the absence of a second class label in the training data. In this paper we evaluate the use of a number of up-to-date unsupervised DR techniques for one-class classification and we show that techniques based on cluster coherence and locality preservation are effective.

Keywords  One class classification - Dimensionality reduction - Feature selection - Feature transformation - Principal component analysis - Locality preservation - Cluster coherence

Fulltext Preview

Image of the first page of the fulltext document