Support Vector Machines, Kernel Logistic Regression and Boosting
Ji Zhu6
and Trevor Hastie6 
| (6) |
Department of Statistics, Stanford University, Stanford, CA, 94305 |
Abstract
The support vector machine is known for its excellent performance in binary classification, i.e., the response y ∈ −1, 1, but its appropriate extension to the multi-class case is still an on-going research issue. Another weakness of the
SVM is that it only estimates sign[p(x) − 1/2], while the probability p(x) is often of interest itself, where p(x) = P(Y = 1∣X = x) is the conditional probability of a point being in class 1 given X = x. We propose a new approach for classification, called the import vector machine, which is built on kernel logistic regression
(KLR). We show on some examples that the IVM performs as well as the SVM in binary classification. The IVM can naturally be
generalized to the multi-class case. Furthermore, the IVM provides an estimate of the underlying class probabilities. Similar
to the “support points” of the SVM, the IVM model uses only a fraction of the training data to index kernel basis functions,
typically a much smaller fraction than the SVM. This can give the IVM a computational advantage over the SVM, especially when
the size of the training data set is large. We illustrate these techniques on some examples, and make connections with boosting,
another popular machine-learning method for classification.
Keywords classification - kernel methods - logistic regression - multi-class learning - radial basis - reproducing kernel Hilbert space (RKHS) - support vector machines
References secured to subscribers.