Cooperation in stochastic games through communication

Raghav Aras; Alain Dutech; François Charpillet

doi:10.1145/1082473.1082691

Communication Dans Un Congrès Année : 2005

Cooperation in stochastic games through communication

(1) , (1) , (1)

Raghav Aras

Fonction : Auteur
PersonId : 830439

Autonomous intelligent machine

Alain Dutech

Fonction : Auteur
PersonId : 1580
IdHAL : alain-dutech
ORCID : 0000-0001-7549-7988
IdRef : 131102532

Autonomous intelligent machine

François Charpillet

Fonction : Auteur
PersonId : 1910
IdHAL : francois-charpillet
ORCID : 0000-0001-8260-1536
IdRef : 070140553

Autonomous intelligent machine

Résumé

We describe a process of reinforcement learning in two-agent general-sum stochastic games under imperfect observability of moves and payoffs. In practice, it is known that using naive Q-learning, agents can learn equilibrium policies under the discounted reward criterion although these may be arbitrarily worse for both the agents than a non-equilibrium policy, in the absence of global optima. We aim for Pareto-efficiency in policies, in which agents enjoy higher payoffs than in an equilibrium and show agents may employ naive Q-learning with the addition of communication and a payoff interpretation rule, to achieve this. In principle, our objective is to shift the focus of the learning from equilibria (to which solipsistic algorithms converge) to non-equilibria by transforming the latter to equilibria.

Domaines

Apprentissage [cs.LG]

Fichier principal

p525-aras.pdf (58.6 Ko)

Raghav Aras : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00000208

Soumis le : mardi 13 septembre 2005-14:54:10

Dernière modification le : jeudi 15 février 2024-03:31:05

Archivage à long terme le : jeudi 1 avril 2010-22:24:00

Dates et versions

inria-00000208 , version 1 (13-09-2005)

Identifiants

HAL Id : inria-00000208 , version 1
DOI : 10.1145/1082473.1082691

Citer

Raghav Aras, Alain Dutech, François Charpillet. Cooperation in stochastic games through communication. 4th International Joint Conference on Autonomous Agents and Multiagent Systems - AAMAS'05, Jul 2005, Utrecht/ The Netherlands, pp.1197 - 1198, ⟨10.1145/1082473.1082691⟩. ⟨inria-00000208⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA UNIV-LORRAINE INRIA2 LORIA UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

175 Consultations

164 Téléchargements

Cooperation in stochastic games through communication

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager