Solving infinite-horizon Dec-POMDPs using Finite State Controllers within JESP

Yang You; Vincent Thomas; Francis Colas; Olivier Buffet

Pré-Publication, Document De Travail Année : 2021

Solving infinite-horizon Dec-POMDPs using Finite State Controllers within JESP

, (1) , (1) , (1)

Yang You

Fonction : Auteur
PersonId : 1115716

Vincent Thomas

Fonction : Auteur
PersonId : 16368
IdHAL : vincent-thomas
ORCID : 0000-0003-3401-4649

Lifelong Autonomy and interaction skills for Robots in a Sensing ENvironment

Francis Colas

Fonction : Auteur
PersonId : 10037
IdHAL : francis-colas
ORCID : 0000-0002-7449-7676
IdRef : 112906206

Lifelong Autonomy and interaction skills for Robots in a Sensing ENvironment

Olivier Buffet

Fonction : Auteur
PersonId : 1407
IdHAL : olivier-buffet
ORCID : 0000-0002-5072-5857

Lifelong Autonomy and interaction skills for Robots in a Sensing ENvironment

Résumé

This paper looks at solving collaborative planning problems formalized as Decentralized POMDPs (Dec-POMDPs) by searching for Nash equilibria, i.e., situations where each agent's policy is a best response to the other agents' (fixed) policies. While the Joint Equilibrium-based Search for Policies (JESP) algorithm does this in the finite-horizon setting relying on policy trees, we propose here to adapt it to infinite-horizon Dec-POMDPs by using finite state controller (FSC) policy representations. In this article, we (1) explain how to turn a Dec-POMDP with $N-1$ fixed FSCs into an infinite-horizon POMDP whose solution is an $N^\text{th}$ agent best response; (2) propose a JESP variant, called \infJESP, using this to solve infinite-horizon Dec-POMDPs; (3) introduce heuristic initializations for JESP aiming at leading to good solutions; and (4) conduct experiments on state-of-the-art benchmark problems to evaluate our approach.

Domaines

Intelligence artificielle [cs.AI]

Olivier Buffet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03523415

Soumis le : mercredi 12 janvier 2022-16:26:13

Dernière modification le : mercredi 29 novembre 2023-16:19:56

Dates et versions

hal-03523415 , version 1 (12-01-2022)

Identifiants

HAL Id : hal-03523415 , version 1
ARXIV : 2109.08755

Citer

Yang You, Vincent Thomas, Francis Colas, Olivier Buffet. Solving infinite-horizon Dec-POMDPs using Finite State Controllers within JESP. 2021. ⟨hal-03523415⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA LORIA-AIS ANR

38 Consultations

0 Téléchargements

Solving infinite-horizon Dec-POMDPs using Finite State Controllers within JESP

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager