Volume 58, Number 1, 39-77, DOI: 10.1007/s10994-005-5011-x

ROC ‘n’ Rule Learning—Towards a Better Understanding of Covering Algorithms

Johannes Fürnkranz and Peter A. Flach

View Related Documents

Abstract

This paper provides an analysis of the behavior of separate-and-conquer or covering rule learning algorithms by visualizing their evaluation metrics and their dynamics in coverage space, a variant of ROC space. Our results show that most commonly used metrics, including accuracy, weighted relative accuracy, entropy, and Gini index, are equivalent to one of two fundamental prototypes: precision, which tries to optimize the area under the ROC curve for unknown costs, and a cost-weighted difference between covered positive and negative examples, which tries to find the optimal point under known or assumed costs. We also show that a straightforward generalization of the m-estimate trades off these two prototypes. Furthermore, our results show that stopping and filtering criteria like CN2rsquos significance test focus on identifying significant deviations from random classification, which does not necessarily avoid overfitting. We also identify a problem with Foilrsquos MDL-based encoding length restriction, which proves to be largely equivalent to a variable threshold on the recall of the rule. In general, we interpret these results as evidence that, contrary to common conception, pre-pruning heuristics are not very well understood and deserve more investigation.

inductive rule learning - ROC analysis - rule evaluation metrics - coverage space

Fulltext Preview

Image of the first page of the fulltext document