Illustrated review of convergence conditions of the value iteration algorithm and the rolling horizon procedure for average-cost MDPs

Eugenio Della Vecchia; Silvia C. Di Marco; Alain Jean-Marie

doi:10.1007/s10479-012-1070-0

Article Dans Une Revue Annals of Operations Research Année : 2012

Illustrated review of convergence conditions of the value iteration algorithm and the rolling horizon procedure for average-cost MDPs

(1) , (1) , (2, 3)

1
2
3

Eugenio Della Vecchia

Fonction : Auteur

Facultad de Ciencias Exactas, Ingenieria y Agrimensura [Rosario]

Silvia C. Di Marco

Fonction : Auteur

Facultad de Ciencias Exactas, Ingenieria y Agrimensura [Rosario]

Alain Jean-Marie

Fonction : Auteur
PersonId : 1711
IdHAL : alain-jean-marie
ORCID : 0000-0002-9210-4530
IdRef : 059937386

Models for the performance analysis and the control of networks

Hors Équipe

Résumé

This paper is concerned with the links between the Value Iteration algorithm and the Rolling Horizon procedure, for solving problems of stochastic optimal control under the long-run average criterion, in Markov Decision Processes with finite state and action spaces. We review conditions of the literature which imply the geometric convergence of Value It- eration to the optimal value. Aperiodicity is an essential prerequisite for convergence. We prove that the convergence of Value Iteration generally implies that of Rolling Horizon. We also present a modified Rolling Horizon procedure that can be applied to models without analyzing periodicity, and discuss the impact of this transformation on convergence. We il- lustrate with numerous examples the different results. Finally, we discuss rules for stopping Value Iteration or finding the length of a Rolling Horizon. We provide an example which demonstrates the difficulty of the question, disproving in particular a conjectured rule pro- posed by Puterman.

Domaines

Recherche opérationnelle [math.OC]

Alain Jean-Marie : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00862915

Soumis le : mardi 17 septembre 2013-17:45:49

Dernière modification le : vendredi 24 mars 2023-14:52:57

Dates et versions

hal-00862915 , version 1 (17-09-2013)

Identifiants

HAL Id : hal-00862915 , version 1
DOI : 10.1007/s10479-012-1070-0

Citer

Eugenio Della Vecchia, Silvia C. Di Marco, Alain Jean-Marie. Illustrated review of convergence conditions of the value iteration algorithm and the rolling horizon procedure for average-cost MDPs. Annals of Operations Research, 2012, 199 (1), pp.193-214. ⟨10.1007/s10479-012-1070-0⟩. ⟨hal-00862915⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA LIRMM INRIA2 HORSEQUIPE MIPS UNIV-MONTPELLIER

130 Consultations

0 Téléchargements

Illustrated review of convergence conditions of the value iteration algorithm and the rolling horizon procedure for average-cost MDPs

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager