Nearest neighbor classification in infinite dimension

Frédéric Cérou; Arnaud Guyader

Rapport (Rapport De Recherche) Année : 2005

Nearest neighbor classification in infinite dimension

(1) , (1)

Frédéric Cérou

Fonction : Auteur
PersonId : 830625

Applications of interacting particle systems to statistics

Arnaud Guyader

Fonction : Auteur
PersonId : 857058

Applications of interacting particle systems to statistics

Résumé

Let $X$ be a random element in a metric space $(\calF,d)$, and let $Y$ be a random variable with value $0$ or $1$. $Y$ is called the class, or the label, of $X$. Assume $n$ i.i.d. copies $(X_i,Y_i)_1\leqi\leqn$. The problem of classification is to predict the label of a new random element $X$. The $k$-nearest neighbor classifier consists in the simple following rule : look at the $k$ nearest neighbors of $X$ and choose $0$ or $1$ for its label according to the majority vote. If $(\calF,d)=(R^d,||.||)$, Stone has proved in 1977 the universal consistency of this classifier : its probability of error converges to the Bayes error, whatever the distribution of $(X,Y)$. We show in this paper that this result is no more valid in general metric spaces. However, if $(\calF,d)$ is separable and if a regularity condition is assumed, then the $k$-nearest neighbor classifier is weakly consistent.

Mots clés

CLASSIFICATION CONSISTENCY NON PARAMETRIC STATISTICS

Domaines

Autre [cs.OH]

Fichier principal

RR-5536.pdf (289.71 Ko)

Rapport De Recherche Inria : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00070470

Soumis le : vendredi 19 mai 2006-20:35:52

Dernière modification le : vendredi 24 mars 2023-14:52:47

Archivage à long terme le : dimanche 4 avril 2010-21:17:38

Dates et versions

inria-00070470 , version 1 (19-05-2006)

Identifiants

HAL Id : inria-00070470 , version 1

Citer

Frédéric Cérou, Arnaud Guyader. Nearest neighbor classification in infinite dimension. [Research Report] RR-5536, INRIA. 2005, pp.23. ⟨inria-00070470⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UR2-HB CNRS INRIA IRISA INRIA-RRRT IRISA-D5 INRIA2 LARA UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

8843 Consultations

462 Téléchargements

Nearest neighbor classification in infinite dimension

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager