Mixture Martingales Revisited with Applications to Sequential Tests and Confidence Intervals - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Journal of Machine Learning Research Année : 2021

Mixture Martingales Revisited with Applications to Sequential Tests and Confidence Intervals

Wouter M. Koolen
  • Fonction : Auteur
  • PersonId : 976655

Résumé

This paper presents new deviation inequalities that are valid uniformly in time under adaptive sampling in a multi-armed bandit model. The deviations are measured using the Kullback-Leibler divergence in a given one-dimensional exponential family, and may take into account several arms at a time. They are obtained by constructing for each arm a mixture martingale based on a hierarchical prior, and by multiplying those martingales. Our deviation inequalities allow us to analyze stopping rules based on generalized likelihood ratios for a large class of sequential identification problems, and to construct tight confidence intervals for some functions of the means of the arms.
Fichier principal
Vignette du fichier
KK21.pdf (138.71 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01886612 , version 1 (03-10-2018)
hal-01886612 , version 2 (27-11-2018)
hal-01886612 , version 3 (07-12-2021)

Identifiants

Citer

Emilie Kaufmann, Wouter M. Koolen. Mixture Martingales Revisited with Applications to Sequential Tests and Confidence Intervals. Journal of Machine Learning Research, 2021. ⟨hal-01886612v3⟩
204 Consultations
592 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More