Abstraction Pathologies In Markov Decision Processes

Manel Tagorti; Bruno Scherrer; Olivier Buffet; Joerg Hoffmann

Communication Dans Un Congrès Année : 2013

Abstraction Pathologies In Markov Decision Processes

(1) , (1) , (1) , (1, 2)

1
2

Manel Tagorti

Fonction : Auteur
PersonId : 948730

Autonomous intelligent machine

Bruno Scherrer

Fonction : Auteur
PersonId : 1406
IdHAL : bruno-scherrer
IdRef : 073360708

Autonomous intelligent machine

Olivier Buffet

Fonction : Auteur
PersonId : 1407
IdHAL : olivier-buffet
ORCID : 0000-0002-5072-5857

Autonomous intelligent machine

Joerg Hoffmann

Fonction : Auteur
PersonId : 932988

Autonomous intelligent machine

Saarland University [Saarbrücken]

Résumé

Abstraction is a common method to compute lower bounds in classical planning, imposing an equivalence relation on the state space and deriving the lower bound from the quotient system. It is a trivial and well-known fact that refined abstractions can only improve the lower bound. Thus, when we embarked on applying the same technique in the probabilistic setting, our firm belief was to find the same behavior there. We were wrong. Indeed, there are cases where every direct refinement step (splitting one equivalence class into two) yields strictly worse bounds. We give a comprehensive account of the issues involved, for two wide-spread methods to define and use abstract MDPs.

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

jfpda13-a.pdf (129.57 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Olivier Buffet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00907295

Soumis le : jeudi 21 novembre 2013-09:52:11

Dernière modification le : jeudi 1 février 2024-10:05:16

Archivage à long terme le : samedi 22 février 2014-04:32:09

Dates et versions

hal-00907295 , version 1 (21-11-2013)

Identifiants

HAL Id : hal-00907295 , version 1

Citer

Manel Tagorti, Bruno Scherrer, Olivier Buffet, Joerg Hoffmann. Abstraction Pathologies In Markov Decision Processes. 8èmes Journées Francophones sur la Planification, la Décision et l'Apprentissage pour la conduite de systèmes, Jul 2013, Lille, France. ⟨hal-00907295⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA UNIV-LORRAINE INRIA2 LORIA LORIA-AIS UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

159 Consultations

110 Téléchargements

Abstraction Pathologies In Markov Decision Processes

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager