Constrained Markov Decision Processes with Total Expected Cost Criteria - Archive ouverte HAL Access content directly
Conference Papers Year :

Constrained Markov Decision Processes with Total Expected Cost Criteria

(1, 2, 3) , (2, 1) , (4)
1
2
3
4

Abstract

We study in this paper a multiobjective dynamic programm-ming where all the criteria are in the form of total expected sum of costs till absorption in some set of states M. We assume that instantaneous costs are strictly positive and make no assumption on the ergodic structure of the Markov Decision Process. Our main result is to extend the linear program solution approach that was previously derived for transient CMDPs (Constrained Markov Decision Processes) to general ergodic structure. Several (additive) cost met-rics are defined and (possibly randomized) routing policies are sought which minimize one of the costs subject to constraints over the other objectives.
Fichier principal
Vignette du fichier
cr1.pdf (193.14 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-02053360 , version 1 (01-03-2019)

Identifiers

Cite

Eitan Altman, Said Boularouk, Didier Josselin. Constrained Markov Decision Processes with Total Expected Cost Criteria. VALUETOOLS 2019 - 12th EAI International Conference on Performance Evaluation Methodologies and Tools, Mar 2019, Palma, Spain. pp.191-192, ⟨10.1145/3306309.3306342⟩. ⟨hal-02053360⟩
165 View
172 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More