Qualitative Multi-Armed Bandits: A Quantile-Based Approach

Balazs Szorenyi; Róbert Busa-Fekete; Paul Weng; Eyke Hüllermeier

Communication Dans Un Congrès Année : 2015

Qualitative Multi-Armed Bandits: A Quantile-Based Approach

(1, 2, 3) , (4) , (5, 6) , (4)

1
2
3
4
5
6

Balazs Szorenyi

Fonction : Auteur
PersonId : 961175

Department of Electrical Engineering - Technion [Haïfa]

MTA-SZTE Research Group on Artificial Intelligence

Sequential Learning

Róbert Busa-Fekete

Fonction : Auteur

University of Paderborn

Paul Weng

Fonction : Auteur

SYSU-CMU Joint Institute of Engineering

SYSU-CMU Shunde International Joint Research Institute

Eyke Hüllermeier

Fonction : Auteur

University of Paderborn

Résumé

We formalize and study the multi-armed bandit (MAB) problem in a generalized stochastic setting, in which rewards are not assumed to be numerical. Instead, rewards are measured on a qualitative scale that allows for comparison but invalidates arithmetic operations such as averaging. Correspondingly, instead of characterizing an arm in terms of the mean of the underlying distribution, we opt for using a quantile of that distribution as a representative value. We address the problem of quantile-based online learning both for the case of a finite (pure exploration) and infinite time horizon (cumulative regret minimization). For both cases, we propose suitable algorithms and analyze their properties. These properties are also illustrated by means of first experimental studies.

Domaines

Apprentissage [cs.LG] Statistiques [math.ST]

Fichier principal

qmab_final.pdf (484.63 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Balazs Szorenyi : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01204708

Soumis le : jeudi 24 septembre 2015-14:36:36

Dernière modification le : mercredi 17 avril 2024-14:39:17

Archivage à long terme le : mardi 29 décembre 2015-09:49:32

Dates et versions

hal-01204708 , version 1 (24-09-2015)

Identifiants

HAL Id : hal-01204708 , version 1

Citer

Balazs Szorenyi, Róbert Busa-Fekete, Paul Weng, Eyke Hüllermeier. Qualitative Multi-Armed Bandits: A Quantile-Based Approach. 32nd International Conference on Machine Learning, Jul 2015, Lille, France. pp.1660-1668. ⟨hal-01204708⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 CRISTAL-SEQUEL UNIV-LILLE

871 Consultations

453 Téléchargements

Qualitative Multi-Armed Bandits: A Quantile-Based Approach

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager