Cooperative Markov decision processes: time consistency, greedy players satisfaction, and cooperation maintenance

Konstantin Avrachenkov; Laura Cottatellucci; Lorenzo Maggi

doi:10.1007/s00182-012-0343-9

Article Dans Une Revue International Journal of Game Theory Année : 2013

Cooperative Markov decision processes: time consistency, greedy players satisfaction, and cooperation maintenance

(1) , (2) , (2)

1
2

Konstantin Avrachenkov

Fonction : Auteur
PersonId : 11963
IdHAL : konstantin-avrachenkov
ORCID : 0000-0002-8124-8272
IdRef : 087245280

Models for the performance analysis and the control of networks

Laura Cottatellucci

Fonction : Auteur
PersonId : 1141113

Eurecom [Sophia Antipolis]

Lorenzo Maggi

Fonction : Auteur

Eurecom [Sophia Antipolis]

Résumé

We deal with multi-agent Markov decision processes (MDPs) in which cooperation among players is allowed. We find a cooperative payoff distribution procedure (MDP-CPDP) that distributes in the course of the game the payoff that players would earn in the long run game. We show under which conditions such a MDP-CPDP fulfills a time consistency property, contents greedy players, and strengthen the coalition cohesiveness throughout the game. Finally we refine the concept of Core for Cooperative MDPs.

Domaines

Réseaux et télécommunications [cs.NI]

Konstantin Avrachenkov : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00926471

Soumis le : jeudi 9 janvier 2014-16:01:32

Dernière modification le : mercredi 15 mars 2023-08:58:09

Dates et versions

hal-00926471 , version 1 (09-01-2014)

Identifiants

HAL Id : hal-00926471 , version 1
DOI : 10.1007/s00182-012-0343-9

Citer

Konstantin Avrachenkov, Laura Cottatellucci, Lorenzo Maggi. Cooperative Markov decision processes: time consistency, greedy players satisfaction, and cooperation maintenance. International Journal of Game Theory, 2013, 42 (1), pp.239-262. ⟨10.1007/s00182-012-0343-9⟩. ⟨hal-00926471⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA EURECOM INRIA2

138 Consultations

0 Téléchargements

Cooperative Markov decision processes: time consistency, greedy players satisfaction, and cooperation maintenance

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager