Improved Learning Complexity in Combinatorial Pure Exploration Bandits

Victor Gabillon; Alessandro Lazaric; Mohammad Ghavamzadeh; Ronald Ortner; Peter Bartlett

Communication Dans Un Congrès Année : 2016

Improved Learning Complexity in Combinatorial Pure Exploration Bandits

(1) , (2, 3) , (4, 2) , (5) , (1)

1
2
3
4
5

Victor Gabillon

Fonction : Auteur

Queensland University of Technology [Brisbane]

Alessandro Lazaric

Fonction : Auteur
PersonId : 851
IdHAL : alessandro-lazaric
ORCID : 0000-0002-8970-413X
IdRef : 188701486

Sequential Learning

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Mohammad Ghavamzadeh

Fonction : Auteur

Adobe Systems Inc.

Sequential Learning

Ronald Ortner

Fonction : Auteur

Montanuniversität Leoben

Peter Bartlett

Fonction : Auteur

Queensland University of Technology [Brisbane]

Résumé

We study the problem of combinatorial pure exploration in the stochastic multi-armed bandit problem. We first construct a new measure of complexity that provably characterizes the learning performance of the algorithms we propose for the fixed confidence and the fixed budget setting. We show that this complexity is never higher than the one in existing work and illustrate a number of configurations in which it can be significantly smaller. While in general this improvement comes at the cost of increased computational complexity, we provide a series of examples , including a planning problem, where this extra cost is not significant.

Domaines

Machine Learning [stat.ML]

Fichier principal

AISTATS_full_CR.pdf (1.11 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Alessandro Lazaric : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01322198

Soumis le : jeudi 26 mai 2016-17:25:55

Dernière modification le : mercredi 24 janvier 2024-09:54:23

Archivage à long terme le : samedi 27 août 2016-11:01:03

Dates et versions

hal-01322198 , version 1 (26-05-2016)

Identifiants

HAL Id : hal-01322198 , version 1

Citer

Victor Gabillon, Alessandro Lazaric, Mohammad Ghavamzadeh, Ronald Ortner, Peter Bartlett. Improved Learning Complexity in Combinatorial Pure Exploration Bandits. Proceedings of the 19th International Conference on Artificial Intelligence (AISTATS), May 2016, Cadiz, Spain. ⟨hal-01322198⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 CRISTAL-SEQUEL UNIV-LILLE ANR

174 Consultations

45 Téléchargements

Improved Learning Complexity in Combinatorial Pure Exploration Bandits

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager