A Model-Based Actor-Critic Algorithm in Continuous Time and Space - Inria - Institut national de recherche en sciences et technologies du numérique Access content directly
Conference Papers Year : 2003

A Model-Based Actor-Critic Algorithm in Continuous Time and Space

Rémi Coulom
  • Function : Author
  • PersonId : 836044

Abstract

This paper presents a model-based actor-critic algorithm in continuous time and space. Two function approximators are used: one learns the policy (the actor) and the other learns the state-value function (the critic). The critic learns with the TD(lambda) algorithm and the actor by gradient ascent on the Hamiltonian. A similar algorithm had been proposed by Doya, but this one is more general. This algorithm was applied successfully to teach simulated articulated robots to swim.
Fichier principal
Vignette du fichier
A03-R-125.pdf (76.16 Ko) Télécharger le fichier
Loading...

Dates and versions

inria-00107659 , version 1 (19-10-2006)

Identifiers

  • HAL Id : inria-00107659 , version 1

Cite

Rémi Coulom. A Model-Based Actor-Critic Algorithm in Continuous Time and Space. Sixth European Workshop on Reinforcement Learning - EWRL6, Sep 2003, Nancy, France, 2 p. ⟨inria-00107659⟩
129 View
68 Download

Share

Gmail Facebook X LinkedIn More