On Elimination Strategies for Bandit Fixed-Confidence Identification

Andrea Tirinzoni; Rémy Degenne

Communication Dans Un Congrès Année : 2022

On Elimination Strategies for Bandit Fixed-Confidence Identification

(1) , (2)

1
2

Andrea Tirinzoni

Fonction : Auteur

Meta AI

Rémy Degenne

Fonction : Auteur
PersonId : 748911
IdHAL : remydegenne

Scool

Résumé

Elimination algorithms for bandit identification, which prune the plausible correct answers sequentially until only one remains, are computationally convenient since they reduce the problem size over time. However, existing elimination strategies are often not fully adaptive (they update their sampling rule infrequently) and are not easy to extend to combinatorial settings, where the set of answers is exponentially large in the problem dimension. On the other hand, most existing fully-adaptive strategies to tackle general identification problems are computationally demanding since they repeatedly test the correctness of every answer, without ever reducing the problem size. We show that adaptive methods can be modified to use elimination in both their stopping and sampling rules, hence obtaining the best of these two worlds: the algorithms (1) remain fully adaptive, (2) suffer a sample complexity that is never worse of their non-elimination counterpart, and (3) provably eliminate certain wrong answers early. We confirm these benefits experimentally, where elimination improves significantly the computational complexity of adaptive methods on common tasks like best-arm identification in linear bandits.

Domaines

Machine Learning [stat.ML]

Rémy Degenne : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03830692

Soumis le : mercredi 26 octobre 2022-14:50:54

Dernière modification le : mercredi 24 janvier 2024-09:54:24

Dates et versions

hal-03830692 , version 1 (26-10-2022)

Identifiants

HAL Id : hal-03830692 , version 1
ARXIV : 2205.10936

Citer

Andrea Tirinzoni, Rémy Degenne. On Elimination Strategies for Bandit Fixed-Confidence Identification. NeurIPS 2022 - 36th Conference on Neural Information Processing System, Nov 2022, New Orleans, United States. ⟨hal-03830692⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 UNIV-LILLE CRISTAL-SCOOL

23 Consultations

0 Téléchargements

On Elimination Strategies for Bandit Fixed-Confidence Identification

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager