Basis Expansion in Natural Actor Critic Methods

Sertan Girgin; Philippe Preux

Communication Dans Un Congrès Année : 2008

Basis Expansion in Natural Actor Critic Methods

(1) , (1, 2, 3)

1
2
3

Sertan Girgin

Fonction : Auteur

Sequential Learning

Philippe Preux

Fonction : Auteur
PersonId : 5488
IdHAL : preux-philippe
IdRef : 059896353

Sequential Learning

Groupe de Recherche en Apprentissage Automatique

Laboratoire d'Informatique Fondamentale de Lille

Résumé

In reinforcement learning, the aim of the agent is to find a policy that maximizes its expected return. Policy gradient methods try to accomplish this goal by directly approximating the policy using a parametric function approximator; the expected return of the current policy is estimated and its parameters are updated by steepest ascent in the direction of the gradient of the expected return with respect to the policy parameters. In general, the policy is defined in terms of a set of basis functions that capture important features of the problem. Since the quality of the resulting policies directly depend on the set of basis func- tions, and defining them gets harder as the complexity of the problem increases, it is important to be able to find them automatically. In this paper, we propose a new approach which uses cascade-correlation learn- ing architecture for automatically constructing a set of basis functions within the context of Natural Actor-Critic (NAC) algorithms. Such basis functions allow more complex policies be represented, and consequently improve the performance of the resulting policies. We also present the effectiveness of the method empirically.

Mots clés

reinforcement learning natural gradient policy gradient feature discovery

Domaines

Apprentissage [cs.LG]

Fichier principal

ewrl8.pdf (140.14 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Preux Philippe : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00826055

Soumis le : mardi 4 juin 2013-09:15:27

Dernière modification le : vendredi 24 mars 2023-14:52:57

Archivage à long terme le : jeudi 5 septembre 2013-04:19:25

Dates et versions

hal-00826055 , version 1 (04-06-2013)

Identifiants

HAL Id : hal-00826055 , version 1

Citer

Sertan Girgin, Philippe Preux. Basis Expansion in Natural Actor Critic Methods. European Workshop on Reinforcement Learning, Jun 2008, Villeneuve d'Ascq, France. pp.110-123. ⟨hal-00826055⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA LIFL LAGIS INRIA2

203 Consultations

280 Téléchargements

Basis Expansion in Natural Actor Critic Methods

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager