Exploiting structure of uncertainty for efficient matroid semi-bandits

Pierre Perrault; Vianney Perchet; Michal Valko

Communication Dans Un Congrès Année : 2019

Exploiting structure of uncertainty for efficient matroid semi-bandits

(1) , (2) , (1, 3)

1
2
3

Pierre Perrault

Fonction : Auteur

Sequential Learning

Vianney Perchet

Fonction : Auteur
PersonId : 871940

Centre de Recherche en Économie et Statistique

Michal Valko

Fonction : Auteur
PersonId : 284
IdHAL : michal
IdRef : 22360934X

Sequential Learning

DeepMind [Paris]

Résumé

We improve the efficiency of algorithms for stochastic combinatorial semi-bandits. In most interesting problems, state-of-the-art algorithms take advantage of structural properties of rewards, such as independence. However, while being optimal in terms of asymptotic regret, these algorithms are inefficient. In our paper, we first reduce their implementation to a specific submod-ular maximization. Then, in case of matroid constraints , we design adapted approximation routines , thereby providing the first efficient algorithms that rely on reward structure to improve regret bound. In particular, we improve the state-of-the-art efficient gap-free regret bound by a factor √ m/ log m, where m is the maximum action size. Finally, we show how our improvement translates to more general budgeted combinato-rial semi-bandits.

Domaines

Machine Learning [stat.ML]

Fichier principal

supplementary.pdf (882.58 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Michal Valko : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02387478

Soumis le : vendredi 29 novembre 2019-18:11:03

Dernière modification le : mercredi 24 janvier 2024-09:54:23

Dates et versions

hal-02387478 , version 1 (29-11-2019)

Identifiants

HAL Id : hal-02387478 , version 1

Citer

Pierre Perrault, Vianney Perchet, Michal Valko. Exploiting structure of uncertainty for efficient matroid semi-bandits. International Conference on Machine Learning, 2019, Long Beach, United States. ⟨hal-02387478⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X GENES CNRS INRIA ENSAE PARISTECH CREST ENSAI CRISTAL INRIA2 CRISTAL-SEQUEL UNIV-PARIS-SACLAY UNIV-LILLE X-CREST

138 Consultations

116 Téléchargements

Exploiting structure of uncertainty for efficient matroid semi-bandits

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager