Anytime many-armed bandits

Olivier Teytaud; Sylvain Gelly; Michèle Sebag

Conference Papers Year : 2007

Anytime many-armed bandits

(1) , (1) , (1)

Olivier Teytaud

Function : Author
PersonId : 581
IdHAL : olivier-teytaud
IdRef : 05971008X

Algorithmic number theory for cryptology

Sylvain Gelly

Function : Author

Algorithmic number theory for cryptology

Michèle Sebag

Function : Author
PersonId : 836537

Algorithmic number theory for cryptology

Abstract

This paper introduces the many-armed bandit problem (ManAB), where the number of arms is large comparatively to the relevant number of time steps. While the ManAB framework is relevant to many real-world applications, the state of the art does not offer anytime algorithms handling ManAB problems. Both theory and practice suggest that two problem categories must be distinguished; the easy category includes those problems where good arms have reward probability close to 1; the difficult category includes other problems. Two algorithms termed FAILURE and MUCBT are proposed for the ManAB framework. FAILURE and its variants extend the non-anytime approach proposed for the denumerable-armed bandit and non-asymptotic bounds are shown; it works very efficiently for easy ManAB problems. Meanwhile, MUCBT efficiently deals with difficult ManAB problems.

Domains

Computer Science and Game Theory [cs.GT]

Fichier principal

mabcap2.pdf (972.9 Ko)

Origin : Files produced by the author(s)

Olivier Teytaud : Connect in order to contact the contributor

https://inria.hal.science/inria-00173263

Submitted on : Wednesday, September 19, 2007-2:20:51 PM

Last modification on : Friday, March 24, 2023-2:52:49 PM

Long-term archiving on: Thursday, April 8, 2010-7:46:10 PM

Dates and versions

inria-00173263 , version 1 (19-09-2007)

Identifiers

HAL Id : inria-00173263 , version 1

Cite

Olivier Teytaud, Sylvain Gelly, Michèle Sebag. Anytime many-armed bandits. CAP07, 2007, Grenoble, France. ⟨inria-00173263⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X CNRS INRIA LIX X-LIX X-DEP-INFO PARISTECH INRIA2

303 View

392 Download

Anytime many-armed bandits

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Share