High-Accuracy Value-Function Approximation with Neural Networks Applied to the Acrobot

Rémi Coulom

Conference Papers Year : 2004

High-Accuracy Value-Function Approximation with Neural Networks Applied to the Acrobot

(1)

Rémi Coulom

Function : Author
PersonId : 836044

Neuromimetic intelligence

Abstract

Several reinforcement-learning techniques have already been applied to the Acrobot control problem, using linear function approximators to estimate the value function. In this paper, we present experimental results obtained by using a feedforward neural network instead. The learning algorithm used was model-based continuous TD(lambda). It generated an efficient controller, producing a high-accuracy state-value function. A striking feature of this value function is a very sharp 4-dimensional ridge that is extremely hard to evaluate with linear parametric approximators. From a broader point of view, this experimental success demonstrates some of the qualities of feedforward neural networks in comparison with linear approximators in reinforcement learning.

Keywords

motor control apprentissage par renforcement reinforcement learning neural networks réseaux de neurones contrôle moteur

Domains

Other [cs.OH]

Fichier principal

A04-R-082.pdf (124.25 Ko)

Publications Loria : Connect in order to contact the contributor

https://inria.hal.science/inria-00107776

Submitted on : Thursday, October 19, 2006-9:08:54 AM

Last modification on : Thursday, February 15, 2024-3:30:50 AM

Long-term archiving on: Wednesday, March 29, 2017-1:04:28 PM

Dates and versions

inria-00107776 , version 1 (19-10-2006)

Identifiers

HAL Id : inria-00107776 , version 1

Cite

Rémi Coulom. High-Accuracy Value-Function Approximation with Neural Networks Applied to the Acrobot. 12th European Symposium on Artificial Neural Networks - ESANN'2004, Michel Verleysen, 2004, Bruges, Belgique, pp.7-12. ⟨inria-00107776⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA UNIV-LORRAINE INRIA2 LORIA UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

130 View

337 Download

High-Accuracy Value-Function Approximation with Neural Networks Applied to the Acrobot

Abstract

Keywords

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Share