Solving multichain stochastic games with mean payoff by policy iteration

Marianne Akian; Jean Cochet-Terrasson; Sylvie Detournay; Stéphane Gaubert

doi:10.1109/CDC.2013.6760149

Communication Dans Un Congrès Année : 2013

Solving multichain stochastic games with mean payoff by policy iteration

(1, 2) , (3) , (1, 2) , (1, 2)

1
2
3

Marianne Akian

Fonction : Auteur
PersonId : 830429

Max-plus algebras and mathematics of decision

Centre de Mathématiques Appliquées - Ecole Polytechnique

Jean Cochet-Terrasson

Fonction : Auteur
PersonId : 935262

Contrôle général des armées

Sylvie Detournay

Fonction : Auteur
PersonId : 935260

Max-plus algebras and mathematics of decision

Centre de Mathématiques Appliquées - Ecole Polytechnique

Stéphane Gaubert

Fonction : Auteur
PersonId : 1887
IdHAL : stephane-gaubert
IdRef : 104895306

Max-plus algebras and mathematics of decision

Centre de Mathématiques Appliquées - Ecole Polytechnique

Résumé

Zero-sum stochastic games with finite state and action spaces, perfect information, and mean payoff criteria arise in particular from the monotone discretization of mean-payoff pursuit-evasion deterministic differential games. In that case no irreducibility assumption on the Markov chains associated to strategies are satisfied (multichain games). The value of such a game can be characterized by a system of nonlinear equations, involving the mean payoff vector and an auxiliary vector (relative value or bias). Cochet-Terrasson and Gaubert proposed in (C. R. Math. Acad. Sci. Paris, 2006) a policy iteration algorithm relying on a notion of nonlinear spectral projection (Akian and Gaubert, Nonlinear Analysis TMA, 2003), which allows one to avoid cycling in degenerate iterations. We give here a complete presentation of the algorithm, with details of implementation in particular of the nonlinear projection. This has led to the software PIGAMES and allowed us to present numerical results on pursuit-evasion games.

Domaines

Optimisation et contrôle [math.OC]

Marianne Akian : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00933689

Soumis le : lundi 20 janvier 2014-21:41:54

Dernière modification le : mercredi 17 avril 2024-13:46:26

Dates et versions

hal-00933689 , version 1 (20-01-2014)

Identifiants

HAL Id : hal-00933689 , version 1
DOI : 10.1109/CDC.2013.6760149

Citer

Marianne Akian, Jean Cochet-Terrasson, Sylvie Detournay, Stéphane Gaubert. Solving multichain stochastic games with mean payoff by policy iteration. CDC 2013 - 52nd IEEE Conference on Decision and Control, Dec 2013, Florence, Italy. pp.1834-1841, ⟨10.1109/CDC.2013.6760149⟩. ⟨hal-00933689⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X CNRS INRIA X-CMAP X-DEP-MATHA CMAP INRIA2 TDS-MACS

2613 Consultations

0 Téléchargements

Solving multichain stochastic games with mean payoff by policy iteration

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager