Tristan Mary-Huard (AgroParisTech, UMR INRA/AgroParisTech MIA 518)

Exact Cross-Validation for kNN : application to passive and active learning in classification

vendredi 23 mars 2012, 9h45 - 11h00

Salle de réunion, espace Turing

In the binary classification framework, a closed form expression of the cross-validation Leave-p-Out (LpO)

risk estimator for the k Nearest Neighbor algorithm (kNN) is derived. It is first used to study the LpO risk minimization

strategy for choosing k in the passive learning setting. The impact of p on the choice of k and the LpO estimation of

the risk are inferred. In the active learning setting, a procedure is proposed that selects new examples using a LpO

committee of kNN classifiers. The influence of p on the choice of new examples and the tuning of k at each step is

investigated. The behavior of k chosen by LpO is shown to be different from what is observed in passive learning.