Une approche modifiée de Lambda-Policy Iteration

Christophe Thiery; Bruno Scherrer

Conference Papers Year : 2009

Une approche modifiée de Lambda-Policy Iteration

(1) , (1)

Christophe Thiery

Function : Author
PersonId : 842769

Autonomous intelligent machine

Bruno Scherrer

Function : Author
PersonId : 1406
IdHAL : bruno-scherrer
IdRef : 073360708

Autonomous intelligent machine

Abstract

Dans le cadre du contrôle optimal stochastique, nous proposons une manière modifiée de mettre en oeuvre l'algorithme λ-Policy Iteration (Bertsekas & Tsitsiklis, 1996), une méthode qui généralise Value Iteration et Policy Iteration en introduisant un paramètre λ. Nous montrons que cette version modifiée, qui est analogue à Modified Policy Iteration, généralise tous ces algorithmes et converge vers la fonction de valeur optimale. En nous appuyant sur des arguments analytiques et expérimentaux, nous mettons en évidence le fait que lorsque l'algorithme est appliqué de manière exacte, le paramètre λ ne permet pas d'améliorer la vitesse de convergence de manière significative.

Keywords

Contrôle optimal stochastique Apprentissage par renforcement Programmation dynamique Processus Décisionnels de Markov Modified λ-Policy Iteration

Domains

Artificial Intelligence [cs.AI]

Fichier principal

thiery-christophe.pdf (109.51 Ko)

Origin : Explicit agreement for this submission

Christophe Thiery : Connect in order to contact the contributor

https://inria.hal.science/inria-00418910

Submitted on : Tuesday, September 22, 2009-11:20:46 AM

Last modification on : Friday, March 24, 2023-2:52:52 PM

Long-term archiving on: Thursday, June 30, 2011-11:48:52 AM

Dates and versions

inria-00418910 , version 1 (22-09-2009)

Identifiers

HAL Id : inria-00418910 , version 1

Cite

Christophe Thiery, Bruno Scherrer. Une approche modifiée de Lambda-Policy Iteration. Journées Francophones Planification Décision Apprentissage, UPMC-Paris 6, Jun 2009, Paris, France. ⟨inria-00418910⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA

174 View

192 Download

Une approche modifiée de Lambda-Policy Iteration

Abstract

Keywords

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Share