Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Mining Supervised Classification Performance Studies: A Meta-Analytic Investigation

Adrien JamainContact Information and David J. HandContact Information

(1)  BNP-Paribas, 10 Harewood Avenue, London, NW1 6AA, UK
(2)  Department of Mathematics, and Institute for Mathematical Sciences, Imperial College, London, SW7 2AZ, UK

Published online: 26 June 2008

Abstract  There have been many comparative studies of classification methods in which real datasets are used as a gauge to assess the relative performance of the methods. Since these comparisons often yield inconclusive or limited results on how methods perform, it is often believed that a broader approach combining these studies would shed some light on this difficult question. This paper describes such an attempt: we have sampled the available literature and created a dataset of 5807 classification results. We show that one of the possible ways to analyze the resulting data is an overall assessment of the classification methods, and we present methods for that particular aim. The merits and demerits of such an approach are discussed, and conclusions are drawn which may assist future research: we argue that the current state of the literature hardly allows large-scale investigations.

Keywords  Classification rules - Supervised classification - Neural networks - Tree classifiers - Logistic regression - Nearest neighbor method - Bradley-Terry - Meta-analysis - Data mining

This work was sponsored by the MOD Corporate Research Programme, CISP, as part of a larger project on technology assessment. We would like to express our appreciation to Andrew Webb for his support throughout the entire project, and to Wojtek Krzanowski for valuable comments on a draft of this paper. We would also like to thank the anonymous referees for some very interesting comments, some of which we hope to pursue in future work.

Contact Information Adrien Jamain (Corresponding author)
Email: adrien.jamain@uk.bnpparibas.com

Contact Information David J. Hand
Email: d.j.hand@imperial.ac.uk
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this article
Export this article as RIS | Text
 
Referenced by
3 newer articles

  1. Hand, D J (2009) Evaluating models for classifying customers in retail banking collections. Journal of the Operational Research Society
    [CrossRef]
  2. Hand, David J. (2009) Measuring classifier performance: a coherent alternative to the area under the ROC curve. Machine Learning
    [CrossRef]
  3. Jamain, Adrien (2009) Where are the large and difficult datasets?. Advances in Data Analysis and Classification
    [CrossRef]
Remote Address: 38.107.191.112 • Server: mpweb02
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)