Using linear programming duality for solving finite horizon Dec-POMDPs

Raghav Aras; Alain Dutech; François Charpillet

Rapport (Rapport Technique) Année : 2008

Using linear programming duality for solving finite horizon Dec-POMDPs

(1) , (1) , (1)

Raghav Aras

Fonction : Auteur
PersonId : 830439

Autonomous intelligent machine

Alain Dutech

Fonction : Auteur
PersonId : 1580
IdHAL : alain-dutech
ORCID : 0000-0001-7549-7988
IdRef : 131102532

Autonomous intelligent machine

François Charpillet

Fonction : Auteur
PersonId : 1910
IdHAL : francois-charpillet
ORCID : 0000-0001-8260-1536
IdRef : 070140553

Autonomous intelligent machine

Résumé

This paper studies the problem of finding an optimal finite horizon joint policy for a decentralized partially observable Markov decision process (Dec-POMDP). We present a new algorithm for finding an optimal joint policy. The algorithm is based on the fact that the necessary condition for a joint policy to be optimal is that it be locally optimal (that is, a Nash equilibrium). Through the application of linear programming duality, the necessary condition can be transformed to a nonlinear program which can then further be transformed to a 0-1 mixed integer linear program (MILP) whose optimal solution is an optimal joint policy (in the sequence form). The proposed algorithm thus consists of solving this 0-1 MILP. Computational experience of the 0-1 MILP on two and three agent DEC-POMDPs gives mixed results. On some problems it is faster than existing algorithms, on others it is slower.

Mots clés

Dec-POMDPs decentralized problems

Domaines

Système multi-agents [cs.MA] Informatique et théorie des jeux [cs.GT] Recherche opérationnelle [math.OC]

Fichier principal

RR-6641.pdf (185.79 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Raghav Aras : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00320645

Soumis le : mardi 16 septembre 2008-14:49:56

Dernière modification le : jeudi 15 février 2024-03:31:44

Archivage à long terme le : mardi 28 juin 2011-16:35:05

Dates et versions

inria-00320645 , version 1 (16-09-2008)

Identifiants

HAL Id : inria-00320645 , version 1

Citer

Raghav Aras, Alain Dutech, François Charpillet. Using linear programming duality for solving finite horizon Dec-POMDPs. [Technical Report] RR-6641, INRIA. 2008, pp.27. ⟨inria-00320645⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA INRIA-RRRT UNIV-LORRAINE INRIA2 LORIA LARA UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

154 Consultations

138 Téléchargements

Using linear programming duality for solving finite horizon Dec-POMDPs

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager