Hedging Algorithms and Repeated Matrix Games

Bruno Bouzy; Marc Métivier; Damien Pellier

Communication Dans Un Congrès Année : 2011

Hedging Algorithms and Repeated Matrix Games

(1) , (1) , (1)

Bruno Bouzy

Fonction : Auteur

Laboratoire d'Informatique Paris Descartes

Marc Métivier

Fonction : Auteur

Laboratoire d'Informatique Paris Descartes

Damien Pellier

Fonction : Auteur
PersonId : 4378
IdHAL : pellier
ORCID : 0000-0003-3791-8985
IdRef : 10462213X

Laboratoire d'Informatique Paris Descartes

Résumé

Playing repeated matrix games (RMG) while maximizing the cumulative returns is a basic method to evaluate multi-agent learning (MAL) algorithms. Previous work has shown that UCB, M3, S or Exp3 algorithms have good behaviors on average in RMG. Besides, hedging algorithms have been shown to be effective on prediction problems. An hedging algorithm is made up with a top-level algorithm and a set of basic algorithms. To make its decision, an hedging algorithm uses its top-level algorithm to choose a basic algorithm, and the chosen algorithm makes the decision. This paper experimentally shows that well-selected hedging algorithms are better on average than all previous MAL algorithms on the task of playing RMG against various players. S is a very good top-level algorithm, and UCB and M3 are very good basic algorithms. Furthermore, two-level hedging algorithms are more effective than one-level hedging algorithms, and three levels are not better than two levels.

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

bouzy11.pdf (134.66 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Damien Pellier : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00975943

Soumis le : mercredi 9 avril 2014-13:29:22

Dernière modification le : samedi 25 juin 2022-20:54:22

Archivage à long terme le : mercredi 9 juillet 2014-12:11:11

Dates et versions

hal-00975943 , version 1 (09-04-2014)

Identifiants

HAL Id : hal-00975943 , version 1

Citer

Bruno Bouzy, Marc Métivier, Damien Pellier. Hedging Algorithms and Repeated Matrix Games. Workshop on Machine Learning and Data Mining in and around Games (ECML-PKDD), Sep 2011, Athens, Greece. ⟨hal-00975943⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

LIPADE UP-SCIENCES

151 Consultations

128 Téléchargements

Hedging Algorithms and Repeated Matrix Games

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager