This paper reviews the appropriateness for application to large data sets of standard machine learning algorithms, which were
mainly developed in the context of small data sets. Sampling and parallelisation have proved useful means for reducing computation
time when learning from large data sets. However, such methods assume that algorithms that were designed for use with what
are now considered small data sets are also fundamentally suitable for large data sets. It is plausible that optimal learning
from large data sets requires a different type of algorithm to optimal learning from small data sets. This paper investigates
one respect in which data set size may affect the requirements of a learning algorithm — the bias plus variance decomposition
of classification error. Experiments show that learning from large data sets may be more effective when using an algorithm
that places greater emphasis on bias management, rather than variance management.