Efficient Version Space Algorithms for "Human-in-the-Loop" Model Development - Archive ouverte HAL Access content directly
Preprints, Working Papers, ... Year :

Efficient Version Space Algorithms for "Human-in-the-Loop" Model Development

(1, 2) , (1, 2) , (3)
1
2
3

Abstract

When active learning (AL) is applied to help the user develop a model on a large dataset through interactively presenting data instances for labeling, existing AL techniques can suffer from two main drawbacks: first, they may require hundreds of labeled data instances in order to reach high accuracy; second, retrieving the next instance to label can be time consuming, making it incompatible with the interactive nature of the human exploration process. To address these issues, we introduce a novel version space based AL algorithm for kernel classifiers, which not only has strong theoretical guarantees on performance, but also allows for an efficient implementation in time and space. In addition, by leveraging additional insights obtained in the user labeling process, we are able to factorize the version space to perform active learning in a set of subspaces, which further reduces the user labeling effort. Evaluation results show that our algorithms significantly outperform state-of-theart version space algorithms, as well as a recent factorization-aware algorithm, for model development over large data sets.
Fichier principal
Vignette du fichier
Submission-2020-09.pdf (1.22 Mo) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-03064769 , version 1 (14-12-2020)

Identifiers

  • HAL Id : hal-03064769 , version 1

Cite

Luciano Di Palma, Yanlei Diao, Anna Liu. Efficient Version Space Algorithms for "Human-in-the-Loop" Model Development. 2020. ⟨hal-03064769⟩
80 View
123 Download

Share

Gmail Facebook Twitter LinkedIn More