Adaptive learning in continuous games: Optimal regret bounds and convergence to Nash equilibrium

Yu-Guan Hsieh; Kimon Antonakopoulos; Panayotis Mertikopoulos

Communication Dans Un Congrès Année : 2021

Adaptive learning in continuous games: Optimal regret bounds and convergence to Nash equilibrium

(1) , (2) , (2, 3)

1
2
3

Yu-Guan Hsieh

Fonction : Auteur
PersonId : 1081479

Données, Apprentissage et Optimisation

Kimon Antonakopoulos

Fonction : Auteur
PersonId : 1060439

Performance analysis and optimization of LARge Infrastructures and Systems

Panayotis Mertikopoulos

Fonction : Auteur
PersonId : 1933
IdHAL : mertikop
ORCID : 0000-0003-2026-9616
IdRef : 253119758

Performance analysis and optimization of LARge Infrastructures and Systems

Criteo AI Lab

Résumé

In game-theoretic learning, several agents are simultaneously following their individual interests, so the environment is non-stationary from each player's perspective. In this context, the performance of a learning algorithm is often measured by its regret. However, no-regret algorithms are not created equal in terms of game-theoretic guarantees: depending on how they are tuned, some of them may drive the system to an equilibrium, while others could produce cyclic, chaotic, or otherwise divergent trajectories. To account for this, we propose a range of no-regret policies based on optimistic mirror descent, with the following desirable properties: i) they do not require any prior tuning or knowledge of the game; ii) they all achieve O(√ T) regret against arbitrary, adversarial opponents; and iii) they converge to the best response against convergent opponents. Also, if employed by all players, then iv) they guarantee O(1) social regret; while v) the induced sequence of play converges to Nash equilibrium with O(1) individual regret in all variationally stable games (a class of games that includes all monotone and convex-concave zero-sum games).

Domaines

Optimisation et contrôle [math.OC]

Fichier principal

2021-COLT-AdaptiveLearning.pdf (2.2 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Panayotis Mertikopoulos : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03342410

Soumis le : lundi 13 septembre 2021-12:29:17

Dernière modification le : vendredi 5 avril 2024-03:09:50

Dates et versions

hal-03342410 , version 1 (13-09-2021)

Identifiants

HAL Id : hal-03342410 , version 1

Citer

Yu-Guan Hsieh, Kimon Antonakopoulos, Panayotis Mertikopoulos. Adaptive learning in continuous games: Optimal regret bounds and convergence to Nash equilibrium. COLT 2021 - the 34th Annual Conference on Learning Theory, Aug 2021, Boulder, United States. pp.1-34. ⟨hal-03342410⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA LIG LJK LJK_PS LIG_SRCPR PERSYVAL-LAB INRIA2 TDS-MACS LJK-PS-DAO LIG-SRCPR-POLARIS MIAI ANR LIG_SIDCH

119 Consultations

144 Téléchargements

Adaptive learning in continuous games: Optimal regret bounds and convergence to Nash equilibrium

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager