Infinitely many-armed bandits - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

Infinitely many-armed bandits

Résumé

We consider multi-armed bandit problems where the number of arms is larger than the possible number of experiments. We make a stochastic assumption on the mean-reward of a new selected arm which characterizes its probability of being a near-optimal arm. Our assumption is weaker than in previous works. We describe algorithms based on upper-confidence-bounds applied to a restricted set of randomly selected arms and provide upper-bounds on the resulting expected regret. We also derive a lower-bound which matches (up to a logarithmic factor) the upper-bound in some cases.
Fichier principal
Vignette du fichier
many-armed.pdf (145.07 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00830178 , version 1 (04-06-2013)

Identifiants

  • HAL Id : hal-00830178 , version 1

Citer

Yizao Wang, Jean-Yves Audibert, Rémi Munos. Infinitely many-armed bandits. Advances in Neural Information Processing Systems, 2008, Canada. ⟨hal-00830178⟩
608 Consultations
215 Téléchargements

Partager

Gmail Facebook X LinkedIn More