This paper presents a new model developed by merging a non-parametric
k-nearest-neighbor (
kNN) preprocessor into an underlying support vector machine (SVM) to provide shelters for meaningful training examples, especially
for stray examples scattered around their counterpart examples with different class labels. Motivated by the method of adding
heavier penalty to the stray example to attain a stricter loss function for optimization, the model acts to shelter stray
examples. The model consists of a filtering
kNN emphasizer stage and a classical classification stage. First, the filtering
kNN emphasizer stage was employed to collect information from the training examples and to produce arbitrary weights for stray
examples. Then, an underlying SVM with parameterized real-valued class labels was employed to carry those weights, representing
various emphasized levels of the examples, in the classification. The emphasized weights given as heavier penalties changed
the regularization in the quadratic programming of the SVM, and brought the resultant decision function into a higher training
accuracy. The novel idea of real-valued class labels for conveying the emphasized weights provides an effective way to pursue
the solution of the classification inspired by the additional information. The adoption of the
kNN preprocessor as a filtering stage is effective since it is independent of SVM in the classification stage. Due to its property
of estimating density locally, the
kNN method has the advantage of distinguishing stray examples from regular examples by merely considering their circumstances
in the input space. In this paper, detailed experimental results and a simulated application are given to address the corresponding
properties. The results show that the model is promising in terms of its original expectations.
Keywords k-nearest-neighbor preprocessor - Stray training examples - Support vector machines - Classification - Pattern recognition