A Partially-Observable Markov Decision Process for Dealing with Dynamically Changing Environments

Sotirios P. Chatzis; Dimitrios Kosmopoulos

doi:10.1007/978-3-662-44654-6_11

Communication Dans Un Congrès Année : 2014

A Partially-Observable Markov Decision Process for Dealing with Dynamically Changing Environments

(1) , (2)

1
2

Sotirios P. Chatzis

Fonction : Auteur

Cyprus University of Technology

Dimitrios Kosmopoulos

Fonction : Auteur

TEI Crete

Résumé

Partially Observable Markov Decision Processes (POMDPs) have been met with great success in planning domains where agents must balance actions that provide knowledge and actions that provide reward. Recently, nonparametric Bayesian methods have been successfully applied to POMDPs to obviate the need of a priori knowledge of the size of the state space, allowing to assume that the number of visited states may grow as the agent explores its environment. These approaches rely on the assumption that the agent’s environment remains stationary; however, in real-world scenarios the environment may change over time. In this work, we aim to address this inadequacy by introducing a dynamic nonparametric Bayesian POMDP model that both allows for automatic inference of the (distributional) representations of POMDP states, and for capturing non-stationarity in the modeled environments. Formulation of our method is based on imposition of a suitable dynamic hierarchical Dirichlet process (dHDP) prior over state transitions. We derive efficient algorithms for model inference and action planning and evaluate it on several benchmark tasks.

Domaines

Informatique [cs]

Fichier principal

978-3-662-44654-6_11_Chapter.pdf (204.15 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Hal Ifip : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01391299

Soumis le : jeudi 3 novembre 2016-10:52:13

Dernière modification le : jeudi 5 mars 2020-17:41:13

Archivage à long terme le : samedi 4 février 2017-12:54:49

Dates et versions

hal-01391299 , version 1 (03-11-2016)

Licence

Paternité

Identifiants

HAL Id : hal-01391299 , version 1
DOI : 10.1007/978-3-662-44654-6_11

Citer

Sotirios P. Chatzis, Dimitrios Kosmopoulos. A Partially-Observable Markov Decision Process for Dealing with Dynamically Changing Environments. 10th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI), Sep 2014, Rhodes, Greece. pp.111-120, ⟨10.1007/978-3-662-44654-6_11⟩. ⟨hal-01391299⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP IFIP-AICT IFIP-TC IFIP-WG IFIP-TC12 IFIP-AIAI IFIP-WG12-5 IFIP-AICT-436

127 Consultations

264 Téléchargements

A Partially-Observable Markov Decision Process for Dealing with Dynamically Changing Environments

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager