Efficient Change-Point Detection for Tackling Piecewise-Stationary Bandits

Lilian Besson; Emilie Kaufmann; Odalric-Ambrym Maillard; Julien Seznec

Article Dans Une Revue Journal of Machine Learning Research Année : 2022

Efficient Change-Point Detection for Tackling Piecewise-Stationary Bandits

(1) , (2) , (2) , (2, 3)

1
2
3

Lilian Besson

Fonction : Auteur
PersonId : 14893
IdHAL : lilian-besson
ORCID : 0000-0003-2767-2563
IdRef : 24252883X

École normale supérieure - Rennes

Emilie Kaufmann

Fonction : Auteur
PersonId : 10422
IdHAL : emilie-kaufmann
ORCID : 0000-0002-5496-824X
IdRef : 197040810

Scool

Odalric-Ambrym Maillard

Fonction : Auteur
PersonId : 5563
IdHAL : odalric-ambrym-maillard
ORCID : 0000-0001-7935-7026
IdRef : 158055594

Scool

Julien Seznec

Fonction : Auteur
PersonId : 1084851

Scool

Lelivrescolaire.fr

Résumé

We introduce GLR-klUCB, a novel algorithm for the piecewise iid non-stationary bandit problem with bounded rewards. This algorithm combines an efficient bandit algorithm, kl-UCB, with an efficient, parameter-free, changepoint detector, the Bernoulli Generalized Likelihood Ratio Test, for which we provide new theoretical guarantees of independent interest. Unlike previous non-stationary bandit algorithms using a change-point detector, GLR-klUCB does not need to be calibrated based on prior knowledge on the arms' means. We prove that this algorithm can attain a $O(\sqrt{TA \Upsilon_T\log(T)})$ regret in $T$ rounds on some ``easy'' instances, where A is the number of arms and $\Upsilon_T$ the number of change-points, without prior knowledge of $\Upsilon_T$. In contrast with recently proposed algorithms that are agnostic to $\Upsilon_T$, we perform a numerical study showing that GLR-klUCB is also very efficient in practice, beyond easy instances.

Mots clés

Multi-Armed Bandits Non-Stationary Bandits Change Point Detection

Domaines

Autres [stat.ML]

Fichier principal

BKMS22 (1).pdf (668.02 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Emilie Kaufmann : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02006471

Soumis le : lundi 1 août 2022-09:44:13

Dernière modification le : mercredi 24 janvier 2024-09:54:24

Dates et versions

hal-02006471 , version 1 (04-02-2019)

hal-02006471 , version 2 (08-12-2020)

hal-02006471 , version 3 (01-08-2022)

Licence

Paternité - Pas d'utilisation commerciale - Partage selon les Conditions Initiales

Identifiants

HAL Id : hal-02006471 , version 3
ARXIV : 1902.01575

Citer

Lilian Besson, Emilie Kaufmann, Odalric-Ambrym Maillard, Julien Seznec. Efficient Change-Point Detection for Tackling Piecewise-Stationary Bandits. Journal of Machine Learning Research, 2022. ⟨hal-02006471v3⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 UNIV-RENNES UNIV-LILLE CRISTAL-SCOOL ANR

617 Consultations

894 Téléchargements

Efficient Change-Point Detection for Tackling Piecewise-Stationary Bandits

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager