A Model-Based Actor-Critic Algorithm in Continuous Time and Space - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2003

A Model-Based Actor-Critic Algorithm in Continuous Time and Space

Rémi Coulom
  • Fonction : Auteur
  • PersonId : 836044

Résumé

This paper presents a model-based actor-critic algorithm in continuous time and space. Two function approximators are used: one learns the policy (the actor) and the other learns the state-value function (the critic). The critic learns with the TD(lambda) algorithm and the actor by gradient ascent on the Hamiltonian. A similar algorithm had been proposed by Doya, but this one is more general. This algorithm was applied successfully to teach simulated articulated robots to swim.
Fichier principal
Vignette du fichier
A03-R-125.pdf (76.16 Ko) Télécharger le fichier
Loading...

Dates et versions

inria-00107659 , version 1 (19-10-2006)

Identifiants

  • HAL Id : inria-00107659 , version 1

Citer

Rémi Coulom. A Model-Based Actor-Critic Algorithm in Continuous Time and Space. Sixth European Workshop on Reinforcement Learning - EWRL6, Sep 2003, Nancy, France, 2 p. ⟨inria-00107659⟩
129 Consultations
68 Téléchargements

Partager

Gmail Facebook X LinkedIn More