Choosing Answers in epsilon-Best-Answer Identification for Linear Bandits

Rémy Degenne; Marc Jourdan

Communication Dans Un Congrès Année : 2022

Choosing Answers in epsilon-Best-Answer Identification for Linear Bandits

(1) , (1)

Rémy Degenne

Fonction : Auteur
PersonId : 748911
IdHAL : remydegenne

Scool

Marc Jourdan

Fonction : Auteur
PersonId : 1177653
IdHAL : marc-jourdan
ORCID : 0000-0002-2449-4549

Scool

Résumé

In pure-exploration problems, information is gathered sequentially to answer a question on the stochastic environment. While best-arm identification for linear bandits has been extensively studied in recent years, few works have been dedicated to identifying one arm that is $\varepsilon$-close to the best one (and not exactly the best one). In this problem with several correct answers, an identification algorithm should focus on one candidate among those answers and verify that it is correct. We demonstrate that picking the answer with highest mean does not allow an algorithm to reach asymptotic optimality in terms of expected sample complexity. Instead, a \textit{furthest answer} should be identified. Using that insight to choose the candidate answer carefully, we develop a simple procedure to adapt best-arm identification algorithms to tackle $\varepsilon$-best-answer identification in transductive linear stochastic bandits. Finally, we propose an asymptotically optimal algorithm for this setting, which is shown to achieve competitive empirical performance against existing modified best-arm identification algorithms.

Domaines

Machine Learning [stat.ML]

Rémy Degenne : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03830700

Soumis le : mercredi 26 octobre 2022-14:55:11

Dernière modification le : mercredi 24 janvier 2024-09:54:24

Dates et versions

hal-03830700 , version 1 (26-10-2022)

Identifiants

HAL Id : hal-03830700 , version 1
ARXIV : 2206.04456

Citer

Rémy Degenne, Marc Jourdan. Choosing Answers in epsilon-Best-Answer Identification for Linear Bandits. 39th International Conference on Machine Learning (ICML 2022), Jul 2022, Baltimore, United States. ⟨hal-03830700⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 UNIV-LILLE CRISTAL-SCOOL

20 Consultations

0 Téléchargements

Choosing Answers in epsilon-Best-Answer Identification for Linear Bandits

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager