Efficient Version Space Algorithms for "Human-in-the-Loop" Model Development - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2020

Efficient Version Space Algorithms for "Human-in-the-Loop" Model Development

Résumé

When active learning (AL) is applied to help the user develop a model on a large dataset through interactively presenting data instances for labeling, existing AL techniques can suffer from two main drawbacks: first, they may require hundreds of labeled data instances in order to reach high accuracy; second, retrieving the next instance to label can be time consuming, making it incompatible with the interactive nature of the human exploration process. To address these issues, we introduce a novel version space based AL algorithm for kernel classifiers, which not only has strong theoretical guarantees on performance, but also allows for an efficient implementation in time and space. In addition, by leveraging additional insights obtained in the user labeling process, we are able to factorize the version space to perform active learning in a set of subspaces, which further reduces the user labeling effort. Evaluation results show that our algorithms significantly outperform state-of-theart version space algorithms, as well as a recent factorization-aware algorithm, for model development over large data sets.
Fichier principal
Vignette du fichier
Submission-2020-09.pdf (1.22 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03064769 , version 1 (14-12-2020)

Identifiants

  • HAL Id : hal-03064769 , version 1

Citer

Luciano Di Palma, Yanlei Diao, Anna Liu. Efficient Version Space Algorithms for "Human-in-the-Loop" Model Development. 2020. ⟨hal-03064769⟩
121 Consultations
190 Téléchargements

Partager

Gmail Facebook X LinkedIn More