Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey

Cédric Colas; Tristan Karch; Olivier Sigaud; Pierre-Yves Oudeyer

Pré-Publication, Document De Travail Année : 2021

Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey

(1) , (1) , (2) , (1)

1
2

Cédric Colas

Fonction : Auteur
PersonId : 742663
IdHAL : cedric-colas
ORCID : 0000-0003-0212-427X

Flowing Epigenetic Robots and Systems

Tristan Karch

Fonction : Auteur
PersonId : 744349
IdHAL : tristan-karch

Flowing Epigenetic Robots and Systems

Olivier Sigaud

Fonction : Auteur
PersonId : 14932
IdHAL : olivier-sigaud
ORCID : 0000-0002-8544-0229
IdRef : 072724714

Institut des Systèmes Intelligents et de Robotique

Pierre-Yves Oudeyer

Fonction : Auteur
PersonId : 6675
IdHAL : pyoudeyer
ORCID : 0000-0002-9404-7613
IdRef : 081674481

Flowing Epigenetic Robots and Systems

Résumé

Building autonomous machines that can explore open-ended environments, discover possible interactions and autonomously build repertoires of skills is a general objective of artificial intelligence. Developmental approaches argue that this can only be achieved by autonomous and intrinsically motivated learning agents that can generate, select and learn to solve their own problems. In recent years, we have seen a convergence of developmental approaches, and developmental robotics in particular, with deep reinforcement learning (RL) methods, forming the new domain of developmental machine learning. Within this new domain, we review here a set of methods where deep RL algorithms are trained to tackle the developmental robotics problem of the autonomous acquisition of open-ended repertoires of skills. Intrinsically motivated goal-conditioned RL algorithms train agents to learn to represent, generate and pursue their own goals. The self-generation of goals requires the learning of compact goal encodings as well as their associated goal-achievement functions, which results in new challenges compared to traditional RL algorithms designed to tackle pre-defined sets of goals using external reward signals. This paper proposes a typology of these methods at the intersection of deep RL and developmental approaches, surveys recent approaches and discusses future avenues.

Domaines

Apprentissage [cs.LG] Intelligence artificielle [cs.AI]

Fichier principal

2012.09830(1).pdf (1.39 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Cédric Colas : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03099891

Soumis le : mercredi 6 janvier 2021-13:13:44

Dernière modification le : samedi 7 octobre 2023-21:36:23

Archivage à long terme le : mercredi 7 avril 2021-20:07:27

Dates et versions

hal-03099891 , version 1 (06-01-2021)

Identifiants

HAL Id : hal-03099891 , version 1

Citer

Cédric Colas, Tristan Karch, Olivier Sigaud, Pierre-Yves Oudeyer. Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey. 2021. ⟨hal-03099891⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENSTA CNRS INRIA ISIR INRIA2 SORBONNE-UNIVERSITE SU-SCIENCES ISIR_AMAC

331 Consultations

840 Téléchargements

Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager