Modified policy iteration algorithms are not strongly polynomial for discounted dynamic programming - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Operations Research Letters Année : 2014

Modified policy iteration algorithms are not strongly polynomial for discounted dynamic programming

Résumé

This note shows that the number of arithmetic operations required by any member of a broad class of optimistic policy iteration algorithms to solve a deterministic discounted dynamic programming problem with three states and four actions may grow arbitrarily. Therefore any such algorithm is not strongly polynomial. In particular, the modified policy iteration and $\lambda$-policy iteration algorithms are not strongly polynomial.
Fichier principal
Vignette du fichier
Feinberg_Huang_Scherrer.pdf (106.54 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01091370 , version 1 (05-12-2014)

Identifiants

Citer

Eugene A. Feinberg, Jefferson Huang, Bruno Scherrer. Modified policy iteration algorithms are not strongly polynomial for discounted dynamic programming. Operations Research Letters, 2014, 42, pp.429 - 431. ⟨10.1016/j.orl.2014.07.006⟩. ⟨hal-01091370⟩
404 Consultations
433 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More