Algorithmic and Human Teaching of Sequential Decision Tasks

Maya Cakmak; Manuel Lopes

Communication Dans Un Congrès Année : 2012

Algorithmic and Human Teaching of Sequential Decision Tasks

(1) , (2)

1
2

Maya Cakmak

Fonction : Auteur

Socially Intelligent Machines Lab

Manuel Lopes

Fonction : Auteur
PersonId : 1873
IdHAL : manuel-lopes
ORCID : 0000-0002-6238-8974
IdRef : 188282947

Flowing Epigenetic Robots and Systems

Résumé

A helpful teacher can significantly improve the learning rate of a learning agent. Teaching algorithms have been formally studied within the field of Algorithmic Teaching. These give important insights into how a teacher can select the most informative examples while teaching a new concept. However the field has so far focused purely on classification tasks. In this paper we introduce a novel method for optimally teaching sequential decision tasks. We present an algorithm that automatically selects the set of most informative demonstrations and evaluate it on several navigation tasks. Next, we explore the idea of using this algorithm to produce instructions for humans on how to choose examples when teaching sequential decision tasks. We present a user study that demonstrates the utility of such instructions.

Domaines

Apprentissage [cs.LG]

Fichier principal

aaai_teaching_final.pdf (781.2 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Manuel Lopes : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00755253

Soumis le : mardi 20 novembre 2012-17:39:25

Dernière modification le : mercredi 15 mars 2023-08:50:07

Archivage à long terme le : jeudi 21 février 2013-12:30:57

Dates et versions

hal-00755253 , version 1 (20-11-2012)

Identifiants

HAL Id : hal-00755253 , version 1

Citer

Maya Cakmak, Manuel Lopes. Algorithmic and Human Teaching of Sequential Decision Tasks. AAAI Conference on Artificial Intelligence (AAAI-12), Jul 2012, Toronto, Canada. ⟨hal-00755253⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENSTA INRIA PARISTECH ENSTA_U2IS INRIA2

306 Consultations

373 Téléchargements

Algorithmic and Human Teaching of Sequential Decision Tasks

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager