Bayesian Optimization with Automatic Prior Selection for Data-Efficient Direct Policy Search

Rémi Pautrat; Konstantinos Chatzilygeroudis; Jean-Baptiste Mouret

Communication Dans Un Congrès Année : 2018

Bayesian Optimization with Automatic Prior Selection for Data-Efficient Direct Policy Search

(1) , (1) , (1)

Rémi Pautrat

Fonction : Auteur

Lifelong Autonomy and interaction skills for Robots in a Sensing ENvironment

Konstantinos Chatzilygeroudis

Fonction : Auteur
PersonId : 10921
IdHAL : konstantinos-chatzilygeroudis
ORCID : 0000-0003-3585-1027
IdRef : 234845414

Lifelong Autonomy and interaction skills for Robots in a Sensing ENvironment

Jean-Baptiste Mouret

Fonction : Auteur
PersonId : 1495
IdHAL : jb-mouret
ORCID : 0000-0002-2513-027X
IdRef : 137470002

Lifelong Autonomy and interaction skills for Robots in a Sensing ENvironment

Résumé

One of the most interesting features of Bayesian optimization for direct policy search is that it can leverage priors (e.g., from simulation or from previous tasks) to accelerate learning on a robot. In this paper, we are interested in situations for which several priors exist but we do not know in advance which one fits best the current situation. We tackle this problem by introducing a novel acquisition function, called Most Likely Expected Improvement (MLEI), that combines the likelihood of the priors and the expected improvement. We evaluate this new acquisition function on a transfer learning task for a 5-DOF planar arm and on a possibly damaged, 6-legged robot that has to learn to walk on flat ground and on stairs, with priors corresponding to different stairs and different kinds of damages. Our results show that MLEI effectively identifies and exploits the priors, even when there is no obvious match between the current situations and the priors.

Domaines

Automatique / Robotique Robotique [cs.RO] Intelligence artificielle [cs.AI]

Fichier principal

1709.06919.pdf (3.4 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Jean-Baptiste Mouret : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01768279

Soumis le : mardi 17 avril 2018-10:14:11

Dernière modification le : jeudi 1 février 2024-10:05:32

Dates et versions

hal-01768279 , version 1 (17-04-2018)

Identifiants

HAL Id : hal-01768279 , version 1
ARXIV : 1709.06919

Citer

Rémi Pautrat, Konstantinos Chatzilygeroudis, Jean-Baptiste Mouret. Bayesian Optimization with Automatic Prior Selection for Data-Efficient Direct Policy Search. ICRA 2018 - International Conference on Robotics and Automation, May 2018, Brisbane, Australia. ⟨hal-01768279⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA UNIV-LORRAINE INRIA2 TDS-MACS LORIA LORIA-AIS UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM CREATIV-LAB

181 Consultations

299 Téléchargements

Bayesian Optimization with Automatic Prior Selection for Data-Efficient Direct Policy Search

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager