Boosting Active Learning to Optimality: a Tractable Monte-Carlo, Billiard-based Algorithm

Philippe Rolet; Michèle Sebag; Olivier Teytaud

Conference Papers Year : 2009

Boosting Active Learning to Optimality: a Tractable Monte-Carlo, Billiard-based Algorithm

(1) , (2) , (1, 2, 3)

1
2
3

Philippe Rolet

Function : Author
PersonId : 858981

Laboratoire de Recherche en Informatique

Michèle Sebag

Function : Author
PersonId : 836537

Machine Learning and Optimisation

Olivier Teytaud

Function : Author
PersonId : 581
IdHAL : olivier-teytaud
IdRef : 05971008X

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Algorithmic number theory for cryptology

Abstract

Abstract. This paper focuses on Active Learning with a limited num- ber of queries; in application domains such as Numerical Engineering, the size of the training set might be limited to a few dozen or hundred exam- ples due to computational constraints. Active Learning under bounded resources is formalized as a ﬁnite horizon Reinforcement Learning prob- lem, where the sampling strategy aims at minimizing the expectation of the generalization error. A tractable approximation of the optimal (in- tractable) policy is presented, the Bandit-based Active Learner (BAAL) algorithm. Viewing Active Learning as a single-player game, BAAL com- bines UCT, the tree structured multi-armed bandit algorithm proposed by Kocsis and Szepesv´ri (2006), and billiard algorithms. A proof of a principle of the approach demonstrates its good empirical convergence toward an optimal policy and its ability to incorporate prior AL crite- ria. Its hybridization with the Query-by-Committee approach is found to improve on both stand-alone BAAL and stand-alone QbC.

Domains

Optimization and Control [math.OC]

Fichier principal

BALO.pdf (211.35 Ko)

Origin : Files produced by the author(s)

Olivier Teytaud : Connect in order to contact the contributor

https://inria.hal.science/inria-00433866

Submitted on : Friday, November 20, 2009-1:17:05 PM

Last modification on : Wednesday, April 17, 2024-2:05:15 PM

Long-term archiving on: Thursday, June 30, 2011-11:57:56 AM

Dates and versions

inria-00433866 , version 1 (20-11-2009)

Identifiers

HAL Id : inria-00433866 , version 1

Cite

Philippe Rolet, Michèle Sebag, Olivier Teytaud. Boosting Active Learning to Optimality: a Tractable Monte-Carlo, Billiard-based Algorithm. ECML, 2009, Bled, Slovenia. pp.302-317. ⟨inria-00433866⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X EC-PARIS CNRS INRIA LIX X-LIX X-DEP-INFO UMR8623 INRIA2 LRI-AO TDS-MACS UNIV-PARIS-SACLAY

6319 View

1006 Download

Boosting Active Learning to Optimality: a Tractable Monte-Carlo, Billiard-based Algorithm

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Share