Sparse Temporal Difference Learning using LASSO

Manuel Loth; Manuel Davy; Philippe Preux

Communication Dans Un Congrès Année : 2007

Sparse Temporal Difference Learning using LASSO

(1) , (1, 2) , (1)

1
2

Manuel Loth

Fonction : Auteur
PersonId : 836853

Sequential Learning

Manuel Davy

Fonction : Auteur

Sequential Learning

LAGIS-SI

Philippe Preux

Fonction : Auteur
PersonId : 5488
IdHAL : preux-philippe
IdRef : 059896353

Sequential Learning

Résumé

We consider the problem of on-line value function estimation in reinforcement learning. We concentrate on the function approximator to use. To try to break the curse of dimensionality, we focus on non parametric function approximators. We propose to fit the use of kernels into the temporal difference algorithms by using regression via the LASSO. We introduce the equi-gradient descent algorithm (EGD) which is a direct adaptation of the one recently introduced in the LARS algorithm family for solving the LASSO. We advocate our choice of the EGD as a judicious algorithm for these tasks. We present the EGD algorithm in details as well as some experimental results. We insist on the qualities of the EGD for reinforcement learning.

Domaines

Apprentissage [cs.LG]

Fichier principal

lassoTd.pdf (154.15 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Manuel Loth : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00117075

Soumis le : jeudi 30 novembre 2006-13:15:51

Dernière modification le : vendredi 24 mars 2023-14:52:48

Archivage à long terme le : mardi 6 avril 2010-23:39:31

Dates et versions

inria-00117075 , version 1 (30-11-2006)

Identifiants

HAL Id : inria-00117075 , version 1

Citer

Manuel Loth, Manuel Davy, Philippe Preux. Sparse Temporal Difference Learning using LASSO. IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning, Apr 2007, Hawaï, USA, United States. ⟨inria-00117075⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA LAGIS LAGIS-SI INRIA2

177 Consultations

945 Téléchargements

Sparse Temporal Difference Learning using LASSO

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager