We focus on distribution-free transductive learning. In this setting the learning algorithm is given a ‘full sample’ of unlabeled points. Then, a training sample is selected
uniformly at random from the full sample and the labels of the training points are revealed. The goal is to predict the labels
of the remaining unlabeled points as accurately as possible. The full sample partitions the transductive hypothesis space
into a finite number of equivalence classes. All hypotheses in the same equivalence class, generate the same dichotomy of the full sample. We consider a large volume principle, whereby the priority of each equivalence class is proportional to its “volume” in the hypothesis space.
This is an extended abstract of an article published in the Machine Learning Journal [1].