Online Learning with Noise: A Kernel-Based Policy-Gradient Approach

Emmanuel Daucé; Alain Dutech

Communication Dans Un Congrès Année : 2010

Online Learning with Noise: A Kernel-Based Policy-Gradient Approach

(1) , (2)

1
2

Emmanuel Daucé

Fonction : Auteur
PersonId : 854754

Institut des Sciences du Mouvement Etienne Jules Marey

Alain Dutech

Fonction : Auteur
PersonId : 1580
IdHAL : alain-dutech
ORCID : 0000-0001-7549-7988
IdRef : 131102532

Autonomous intelligent machine

Résumé

Various forms of noise are present in the brain. The role of noise in a exploration/exploitation trade-off is cast into the framework of reinforcement learning for a complex task of motor learning. A neuro-controler using a linear transformation of the input to which is added a gaussian noise is modelized as a stochastic controler that can be learned online in ''direct policy-gradient'' scheme. The reward signal is related to sensor information, thus no direct or indirect model of the system to control is needed. The task chosen (reaching with a multi-joint arm) is redundant and non-linear. The controler inputs are then projected to a feature space of higher dimension using a topographic coding based on gaussian kernels. We show that through a consistent noise level it possible to explore the environnment so as to find good control solution that can be exploited. Besides, the controler is able to adapt continuously to changes in the system dynamics. The general framework of this work will allow to study various noises and their effect, especially since it is quite compatible with more complexe types of stochastic neuro-controler, as demonstrated by other works on binary or spiking networks.

Mots clés

Connectionnist models Plasticity and functional specialization Action selection

Domaines

Intelligence artificielle [cs.AI] Neurosciences [q-bio.NC] Neurosciences

Fichier principal

dauce_noisePolicyGradient_NeuroComp10.pdf (186.9 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Alain Dutech : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00517006

Soumis le : lundi 13 septembre 2010-13:31:07

Dernière modification le : vendredi 24 mars 2023-14:52:53

Archivage à long terme le : mardi 23 octobre 2012-16:00:36

Dates et versions

inria-00517006 , version 1 (13-09-2010)

Identifiants

HAL Id : inria-00517006 , version 1

Citer

Emmanuel Daucé, Alain Dutech. Online Learning with Noise: A Kernel-Based Policy-Gradient Approach. Conférence Française de Neurosciences Computationnelles - NeuroComp 2010, Oct 2010, Lyon, France. ⟨inria-00517006⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-AMU NEUROCOMP2010 UNIV-LORRAINE INRIA2 LORIA ISM-EJM ANR

187 Consultations

80 Téléchargements

Online Learning with Noise: A Kernel-Based Policy-Gradient Approach

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager