Positive and Unlabeled Examples Help Learning

Françesco de Comité; François Denis; Rémi Gilleron; Fabien Letouzey

Communication Dans Un Congrès Année : 1999

Positive and Unlabeled Examples Help Learning

(1) , (1) , (1) , (1)

Françesco de Comité

Fonction : Auteur
PersonId : 176211
IdHAL : francesco-de-comite
ORCID : 0000-0002-9625-7059

Groupe de Recherche en Apprentissage Automatique

François Denis

Fonction : Auteur
PersonId : 832393

Groupe de Recherche en Apprentissage Automatique

Rémi Gilleron

Fonction : Auteur
PersonId : 184332
IdHAL : remi-gilleron
ORCID : 0000-0002-1583-5938
IdRef : 061168718

Groupe de Recherche en Apprentissage Automatique

Fabien Letouzey

Fonction : Auteur

Groupe de Recherche en Apprentissage Automatique

Résumé

In many learning problems, labeled examples are rare or expensive while numerous unlabeled and positive examples are available. However, most learning algorithms only use labeled examples. Thus we address the problem of learning with the help of positive and unlabeled data given a small number of labeled examples. We present both theoretical and empirical arguments showing that learning algorithms can be improved by the use of both unlabeled and positive data. As an illustrating problem, we consider the learning algorithm from statistics for monotone conjunctions in the presence of classification noise and give empirical evidence of our assumptions. We give theoretical results for the improvement of Statistical Query learning algorithms from positive and unlabeled data. Lastly, we apply these ideas to tree induction algorithms. We modify the code of C4.5 to get an algorithm which takes as input a set LAB of labeled examples, a set POS of positive examples and a set UNL of unlabeled data and which uses these three sets to construct the decision tree. We provide experimental results based on data taken from UCI repository which confirm the relevance of this approach.

Domaines

Langage de programmation [cs.PL]

Rémi Gilleron : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00538885

Soumis le : mardi 23 novembre 2010-14:48:40

Dernière modification le : vendredi 24 mars 2023-14:52:53

Dates et versions

inria-00538885 , version 1 (23-11-2010)

Identifiants

HAL Id : inria-00538885 , version 1

Citer

Françesco de Comité, François Denis, Rémi Gilleron, Fabien Letouzey. Positive and Unlabeled Examples Help Learning. Proceedings of the Tenth International Conference on Algorithmic Learning Theory, ALT'99, 1999, Tokyo, Japan. pp.219--230. ⟨inria-00538885⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS LIFL

192 Consultations

0 Téléchargements

Positive and Unlabeled Examples Help Learning

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager