Inverse Kinematics On-line Learning: a Kernel-Based Policy-Gradient approach

Emmanuel Daucé; Alain Dutech

Pré-Publication, Document De Travail Année : 2010

Inverse Kinematics On-line Learning: a Kernel-Based Policy-Gradient approach

(1) , (2)

1
2

Emmanuel Daucé

Fonction : Auteur
PersonId : 854754

Institut des Sciences du Mouvement Etienne Jules Marey

Alain Dutech

Fonction : Auteur
PersonId : 1580
IdHAL : alain-dutech
ORCID : 0000-0001-7549-7988
IdRef : 131102532

Autonomous intelligent machine

Résumé

In machine learning, ``kernel methods'' give a consistent framework for applying the perceptron algorithm to non-linear problems. In reinforcement learning, an analog of the perceptron delta-rule can be derived from the "policy-gradient" approach proposed by Williams in 1992 in the framework of stochastic neural networks. Despite its generality and straighforward applicability to continuous command problems, quite few developments of the method had been proposed since. Here we present an account of the use of a kernel transformation of the perception space for the \emph{on-line} learning of a motor command, in the case of eye orientation and multi-joint arm control. We show first that such a setting allows the system to solve non-linear problems, like the log-like resolution of a foveated retina, or the transformation from a cartesian perception space to the ``angular'' command of the multi-joint arm. More interestingly, the on-line recurrent learning we propose is simple and fully operant in changing environments, and allows for constant improvements of the politics, on the basis of simple and measurables error terms.

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

draft_nips10.pdf (256.52 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Alain Dutech : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00520960

Soumis le : vendredi 24 septembre 2010-15:49:55

Dernière modification le : vendredi 24 mars 2023-14:52:53

Archivage à long terme le : jeudi 25 octobre 2012-11:30:57

Dates et versions

inria-00520960 , version 1 (24-09-2010)

Identifiants

HAL Id : inria-00520960 , version 1

Citer

Emmanuel Daucé, Alain Dutech. Inverse Kinematics On-line Learning: a Kernel-Based Policy-Gradient approach. 2010. ⟨inria-00520960⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-AMU UNIV-LORRAINE INRIA2 LORIA ISM-EJM ANR

116 Consultations

267 Téléchargements

Inverse Kinematics On-line Learning: a Kernel-Based Policy-Gradient approach

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager