High-Accuracy Value-Function Approximation with Neural Networks Applied to the Acrobot

Rémi Coulom

Communication Dans Un Congrès Année : 2004

High-Accuracy Value-Function Approximation with Neural Networks Applied to the Acrobot

(1)

Rémi Coulom

Fonction : Auteur
PersonId : 836044

Neuromimetic intelligence

Résumé

Several reinforcement-learning techniques have already been applied to the Acrobot control problem, using linear function approximators to estimate the value function. In this paper, we present experimental results obtained by using a feedforward neural network instead. The learning algorithm used was model-based continuous TD(lambda). It generated an efficient controller, producing a high-accuracy state-value function. A striking feature of this value function is a very sharp 4-dimensional ridge that is extremely hard to evaluate with linear parametric approximators. From a broader point of view, this experimental success demonstrates some of the qualities of feedforward neural networks in comparison with linear approximators in reinforcement learning.

Mots clés

motor control apprentissage par renforcement reinforcement learning neural networks réseaux de neurones contrôle moteur

Domaines

Autre [cs.OH]

Fichier principal

A04-R-082.pdf (124.25 Ko)

Publications Loria : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00107776

Soumis le : jeudi 19 octobre 2006-09:08:54

Dernière modification le : jeudi 15 février 2024-03:30:50

Archivage à long terme le : mercredi 29 mars 2017-13:04:28

Dates et versions

inria-00107776 , version 1 (19-10-2006)

Identifiants

HAL Id : inria-00107776 , version 1

Citer

Rémi Coulom. High-Accuracy Value-Function Approximation with Neural Networks Applied to the Acrobot. 12th European Symposium on Artificial Neural Networks - ESANN'2004, Michel Verleysen, 2004, Bruges, Belgique, pp.7-12. ⟨inria-00107776⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA UNIV-LORRAINE INRIA2 LORIA UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

130 Consultations

337 Téléchargements

High-Accuracy Value-Function Approximation with Neural Networks Applied to the Acrobot

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager