Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey - Archive ouverte HAL Access content directly
Preprints, Working Papers, ... Year :

Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey

(1) , (1) , (2) , (1)
1
2
Cédric Colas
Tristan Karch
Pierre-Yves Oudeyer

Abstract

Building autonomous machines that can explore open-ended environments, discover possible interactions and autonomously build repertoires of skills is a general objective of artificial intelligence. Developmental approaches argue that this can only be achieved by autonomous and intrinsically motivated learning agents that can generate, select and learn to solve their own problems. In recent years, we have seen a convergence of developmental approaches, and developmental robotics in particular, with deep reinforcement learning (RL) methods, forming the new domain of developmental machine learning. Within this new domain, we review here a set of methods where deep RL algorithms are trained to tackle the developmental robotics problem of the autonomous acquisition of open-ended repertoires of skills. Intrinsically motivated goal-conditioned RL algorithms train agents to learn to represent, generate and pursue their own goals. The self-generation of goals requires the learning of compact goal encodings as well as their associated goal-achievement functions, which results in new challenges compared to traditional RL algorithms designed to tackle pre-defined sets of goals using external reward signals. This paper proposes a typology of these methods at the intersection of deep RL and developmental approaches, surveys recent approaches and discusses future avenues.
Fichier principal
Vignette du fichier
2012.09830(1).pdf (1.39 Mo) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-03099891 , version 1 (06-01-2021)

Identifiers

  • HAL Id : hal-03099891 , version 1

Cite

Cédric Colas, Tristan Karch, Olivier Sigaud, Pierre-Yves Oudeyer. Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey. 2021. ⟨hal-03099891⟩
163 View
578 Download

Share

Gmail Facebook Twitter LinkedIn More