Volume 4, Number 2, 227-243, DOI: 10.1023/A:1022604100933

An Empirical Comparison of Pruning Methods for Decision Tree Induction

John Mingers

View Related Documents

Abstract

This paper compares five methods for pruning decision trees, developed from sets of examples. When used with uncertain rather than deterministic data, decision-tree induction involves three main stages—creating a complete tree able to classify all the training examples, pruning this tree to give statistical reliability, and processing the pruned tree to improve understandability. This paper concerns the second stage—pruning. It presents empirical comparisons of the five methods across several domains. The results show that three methods—critical value, error complexity and reduced error—perform well, while the other two may cause problems. They also show that there is no significant interaction between the creation and pruning methods.

Decision trees - Knowledge acquisition - Uncertain data - Pruning

Fulltext Preview

Image of the first page of the fulltext document