Interactive Robot Education

Riad Akrour; Marc Schoenauer; Michèle Sebag

Communication Dans Un Congrès Année : 2013

Interactive Robot Education

(1, 2) , (1, 2) , (1, 2)

1
2

Riad Akrour

Fonction : Auteur
PersonId : 910562

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Marc Schoenauer

Fonction : Auteur
PersonId : 739309
IdHAL : evomarc
ORCID : 0000-0003-1450-6830
IdRef : 057775575

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Michèle Sebag

Fonction : Auteur
PersonId : 836537

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Résumé

Aimed at on-board robot training, an approach hybridizing active preference learning and reinforcement learning is presented: Interactive Bayesian Policy Search (IBPS) builds a robotic controller through direct and frugal interaction with the human expert, iteratively emitting preferences among a few behaviors demonstrated by the robot. These preferences allow the robot to gradually refine its policy utility estimate, and select a new policy to be demonstrated, after an Expected Utility of Selection criterion. The paper contribution is on handling the preference noise, due to expert's mistakes or disinterest when demonstrated behaviors are equally unsatisfactory. A noise model is proposed, enabling a resource-limited robot to soundly estimate the preference noise and maintain a robust interaction with the expert, thus enforcing a low sample complexity. A proof of principle of the IBPS approach, in simulation and on-board, is presented.

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

PBRL_01-Akrour.pdf (289.6 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Marc Schoenauer : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00931347

Soumis le : mercredi 15 janvier 2014-11:34:08

Dernière modification le : lundi 12 février 2024-09:48:04

Archivage à long terme le : mardi 15 avril 2014-22:16:48

Dates et versions

hal-00931347 , version 1 (15-01-2014)

Identifiants

HAL Id : hal-00931347 , version 1

Citer

Riad Akrour, Marc Schoenauer, Michèle Sebag. Interactive Robot Education. ECML/PKDD Workshop on Reinforcement Learning with Generalized Feedback: Beyond Numeric Rewards, Sep 2013, Berlin, Germany. ⟨hal-00931347⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS CNRS INRIA UMR8623 INRIA2 LRI-AO UNIV-PARIS-SACLAY

451 Consultations

297 Téléchargements

Interactive Robot Education

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager