Multi-agent online learning in time-varying games

Benoît Duvocelle; Panayotis Mertikopoulos; Mathias Staudigl; Dries Vermeulen

doi:10.1287/moor.2022.1283

Article Dans Une Revue Mathematics of Operations Research Année : 2023

Multi-agent online learning in time-varying games

(1) , (2, 3) , (1) , (1)

1
2
3

Benoît Duvocelle

Fonction : Auteur

Maastricht University [Maastricht]

Panayotis Mertikopoulos

Fonction : Auteur
PersonId : 1933
IdHAL : mertikop
ORCID : 0000-0003-2026-9616
IdRef : 253119758

Performance analysis and optimization of LARge Infrastructures and Systems

Criteo AI Lab

Mathias Staudigl

Fonction : Auteur

Maastricht University [Maastricht]

Dries Vermeulen

Fonction : Auteur

Maastricht University [Maastricht]

Résumé

We examine the long-run behavior of multi-agent online learning in games that evolve over time. Specifically, we focus on a wide class of policies based on mirror descent, and we show that the induced sequence of play (a) converges to Nash equilibrium in time-varying games that stabilize in the long run to a strictly monotone limit; and (b) it stays asymptotically close to the evolving equilibrium of the sequence of stage games (assuming they are strongly monotone). Our results apply to both gradient-based and payoff-based feedback - i.e., when players only get to observe the payoffs of their chosen actions.

Domaines

Optimisation et contrôle [math.OC] Autres [stat.ML]

Fichier principal

Main.pdf (737.42 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Panayotis Mertikopoulos : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01891545

Soumis le : jeudi 21 décembre 2023-11:37:54

Dernière modification le : vendredi 5 avril 2024-03:10:01

Dates et versions

hal-01891545 , version 1 (21-12-2023)

Licence

Paternité

Identifiants

HAL Id : hal-01891545 , version 1
ARXIV : 1809.03066
DOI : 10.1287/moor.2022.1283

Citer

Benoît Duvocelle, Panayotis Mertikopoulos, Mathias Staudigl, Dries Vermeulen. Multi-agent online learning in time-varying games. Mathematics of Operations Research, 2023, 48 (2), pp.914-941. ⟨10.1287/moor.2022.1283⟩. ⟨hal-01891545⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA LIG LIG_SRCPR INRIA2 TDS-MACS LIG-SRCPR-POLARIS MIAI ANR LIG_SIDCH

153 Consultations

6 Téléchargements

Multi-agent online learning in time-varying games

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager