Rethinking deep active learning: Using unlabeled data at model training

Oriane Siméoni; Mateusz Budnik; Yannis Avrithis; Guillaume Gravier

Communication Dans Un Congrès Année : 2021

Rethinking deep active learning: Using unlabeled data at model training

(1) , (1) , (1) , (1)

Oriane Siméoni

Fonction : Auteur

Creating and exploiting explicit links between multimedia fragments

Mateusz Budnik

Fonction : Auteur

Creating and exploiting explicit links between multimedia fragments

Yannis Avrithis

Fonction : Auteur
PersonId : 20705
IdHAL : yannis-avrithis
ORCID : 0000-0001-7476-4482
IdRef : 253126193

Creating and exploiting explicit links between multimedia fragments

Guillaume Gravier

Fonction : Auteur
PersonId : 1046
IdHAL : guig
ORCID : 0000-0002-2266-5682
IdRef : 110355415

Creating and exploiting explicit links between multimedia fragments

Résumé

Active learning typically focuses on training a model on few labeled examples alone, while unlabeled ones are only used for acquisition. In this work we depart from this setting by using both labeled and unlabeled data during model training across active learning cycles. We do so by using unsupervised feature learning at the beginning of the active learning pipeline and semi-supervised learning at every active learning cycle, on all available data. The former has not been investigated before in active learning, while the study of latter in the context of deep learning is scarce and recent findings are not conclusive with respect to its benefit. Our idea is orthogonal to acquisition strategies by using more data, much like ensemble methods use more models. By systematically evaluating on a number of popular acquisition strategies and datasets, we find that the use of unla-beled data during model training brings a surprising accuracy improvement in image classification, compared to the differences between acquisition strategies. We thus explore smaller label budgets, even one label per class.

Domaines

Apprentissage [cs.LG] Intelligence artificielle [cs.AI]

Fichier principal

1911.08177.pdf (693.32 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Yannis Avrithis : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02372102

Soumis le : mercredi 20 novembre 2019-11:26:38

Dernière modification le : vendredi 24 mars 2023-14:53:13

Dates et versions

hal-02372102 , version 1 (20-11-2019)

Identifiants

HAL Id : hal-02372102 , version 1
ARXIV : 1911.08177

Citer

Oriane Siméoni, Mateusz Budnik, Yannis Avrithis, Guillaume Gravier. Rethinking deep active learning: Using unlabeled data at model training. ICPR 2020 - 25th International Conference on Pattern Recognition, Jan 2021, Milan, Italy. pp.1-12. ⟨hal-02372102⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA CENTRALESUPELEC INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM CYBERSCHOOL

102 Consultations

346 Téléchargements

Rethinking deep active learning: Using unlabeled data at model training

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager