Monte Carlo Information-Oriented Planning

Vincent Thomas; Gérémy Hutin; Olivier Buffet

Communication Dans Un Congrès Année : 2020

Monte Carlo Information-Oriented Planning

Planification Monte Carlo orientée information.

(1) , (2) , (1)

1
2

Vincent Thomas

Fonction : Auteur
PersonId : 16368
IdHAL : vincent-thomas
ORCID : 0000-0003-3401-4649

Lifelong Autonomy and interaction skills for Robots in a Sensing ENvironment

Gérémy Hutin

Fonction : Auteur

École normale supérieure de Lyon

Olivier Buffet

Fonction : Auteur
PersonId : 1407
IdHAL : olivier-buffet
ORCID : 0000-0002-5072-5857

Lifelong Autonomy and interaction skills for Robots in a Sensing ENvironment

Résumé

In this article, we discuss how to solve information-gathering problems expressed as ρ-POMDPs, an extension of Partially Observable Markov Decision Processes (POMDPs) whose reward ρ depends on the belief state. Point-based approaches used for solving POMDPs have been extended to solving ρ-POMDPs as belief MDPs when its reward ρ is convex in B or when it is Lipschitz-continuous. In the present paper, we build on the POMCP algorithm to propose a Monte Carlo Tree Search for ρ-POMDPs, aiming for an efficient on-line planner which can be used for any ρ function. Adaptations are required due to the belief-dependent rewards to (i) propagate more than one state at a time, and (ii) prevent biases in value estimates. An asymptotic convergence proof to-optimal values is given when ρ is continuous. Experiments are conducted to analyze the algorithms at hand and show that they outperform myopic approaches.

Domaines

Systèmes et contrôle [cs.SY] Intelligence artificielle [cs.AI]

Fichier principal

ecai_2020.pdf (354.33 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Vincent Thomas : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02943028

Soumis le : vendredi 18 septembre 2020-15:09:49

Dernière modification le : lundi 11 septembre 2023-17:41:19

Archivage à long terme le : vendredi 4 décembre 2020-22:25:16

Dates et versions

hal-02943028 , version 1 (18-09-2020)

Identifiants

HAL Id : hal-02943028 , version 1

Citer

Vincent Thomas, Gérémy Hutin, Olivier Buffet. Monte Carlo Information-Oriented Planning. 24th ECAI 2020 - European Conference on Artificial Intelligence, Aug 2020, Santiago de Compostela, Spain. ⟨hal-02943028⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-LYON CNRS INRIA GRID5000 UNIV-LORRAINE INRIA2 LORIA LORIA-AIS UDL SILECS

75 Consultations

175 Téléchargements

Monte Carlo Information-Oriented Planning

Planification Monte Carlo orientée information.

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager