Constrained Markov Decision Processes with Total Expected Cost Criteria

Abstract : We study in this paper a multiobjective dynamic programm-ming where all the criteria are in the form of total expected sum of costs till absorption in some set of states M. We assume that instantaneous costs are strictly positive and make no assumption on the ergodic structure of the Markov Decision Process. Our main result is to extend the linear program solution approach that was previously derived for transient CMDPs (Constrained Markov Decision Processes) to general ergodic structure. Several (additive) cost met-rics are defined and (possibly randomized) routing policies are sought which minimize one of the costs subject to constraints over the other objectives.
Complete list of metadatas

Cited literature [3 references]  Display  Hide  Download

https://hal.inria.fr/hal-02053360
Contributor : Eitan Altman <>
Submitted on : Friday, March 1, 2019 - 11:42:55 AM
Last modification on : Thursday, October 17, 2019 - 12:36:59 PM
Long-term archiving on : Thursday, May 30, 2019 - 1:59:42 PM

File

cr1.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02053360, version 1

Citation

Eitan Altman, Said Boularouk, Didier Josselin. Constrained Markov Decision Processes with Total Expected Cost Criteria. VALUETOOLS 2019 - 12th EAI International Conference on Performance Evaluation Methodologies and Tools, Mar 2019, Palma, Spain. pp.191-192. ⟨hal-02053360⟩

Share

Metrics

Record views

152

Files downloads

141