Recursive Least-Squares Learning with Eligibility Traces

Bruno Scherrer; Matthieu Geist

Communication Dans Un Congrès Année : 2011

Recursive Least-Squares Learning with Eligibility Traces

(1) , (2)

1
2

Bruno Scherrer

Fonction : Auteur
PersonId : 1406
IdHAL : bruno-scherrer
IdRef : 073360708

Autonomous intelligent machine

Matthieu Geist

Fonction : Auteur
PersonId : 6945
IdHAL : matthieu-geist

SUPELEC-Campus Metz

Résumé

In the framework of Markov Decision Processes, we consider the problem of learning a linear approximation of the value function of some fixed policy from one trajectory possibly generated by some other policy. We describe a systematic approach for adapting on-policy learning least squares algorithms of the literature (LSTD, LSPE, FPKF and GPTD/KTD) to off-policy learning with eligibility traces. This leads to two known algorithms, LSTD($\lambda$)/LSPE($\lambda$) and suggests new extensions of FPKF and GPTD/KTD. We describe their recursive implementation, discuss their convergence properties, and illustrate their behavior experimentally. Overall, our study suggests that the state-of-art LSTD($\lambda$) remains the best least-squares algorithm.

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

ewrl.pdf (453.58 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Bruno Scherrer : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00644511

Soumis le : jeudi 24 novembre 2011-15:22:08

Dernière modification le : vendredi 24 mars 2023-14:52:55

Archivage à long terme le : samedi 25 février 2012-02:26:38

Dates et versions

hal-00644511 , version 1 (24-11-2011)

Identifiants

HAL Id : hal-00644511 , version 1

Citer

Bruno Scherrer, Matthieu Geist. Recursive Least-Squares Learning with Eligibility Traces. European Wrokshop on Reinforcement Learning (EWRL 11), Sep 2011, Athens, Greece. ⟨hal-00644511⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

SUPELEC CNRS INRIA SUP_IMS UNIV-LORRAINE INRIA2 LORIA

159 Consultations

256 Téléchargements

Recursive Least-Squares Learning with Eligibility Traces

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager