Skip to Main content Skip to Navigation
Journal articles

Intrinsically Motivated Open-Ended Multi-Task Learning Using Transfer Learning to Discover Task Hierarchy

Abstract : In open-ended continuous environments, robots need to learn multiple parameterised control tasks in hierarchical reinforcement learning. We hypothesise that the most complex tasks can be learned more easily by transferring knowledge from simpler tasks, and faster by adapting the complexity of the actions to the task. We propose a task-oriented representation of complex actions, called procedures, to learn online task relationships and unbounded sequences of action primitives to control the different observables of the environment. Combining both goal-babbling with imitation learning, and active learning with transfer of knowledge based on intrinsic motivation, our algorithm self-organises its learning process. It chooses at any given time a task to focus on; and what, how, when and from whom to transfer knowledge. We show with a simulation and a real industrial robot arm, in cross-task and cross-learner transfer settings, that task composition is key to tackle highly complex tasks. Task decomposition is also efficiently transferred across different embodied learners and by active imitation, where the robot requests just a small amount of demonstrations and the adequate type of information. The robot learns and exploits task dependencies so as to learn tasks of every complexity.
Complete list of metadata
Contributor : Sao Mai Nguyen Connect in order to contact the contributor
Submitted on : Thursday, February 18, 2021 - 5:51:26 PM
Last modification on : Friday, January 21, 2022 - 3:21:01 AM
Long-term archiving on: : Wednesday, May 19, 2021 - 6:01:34 PM


Files produced by the author(s)



Nicolas Duminy, Sao Mai Nguyen, Junshuai Zhu, Dominique Duhaut, Jerome Kerdreux. Intrinsically Motivated Open-Ended Multi-Task Learning Using Transfer Learning to Discover Task Hierarchy. Applied Sciences, MDPI, 2021, 11 (3), pp.975. ⟨10.3390/app11030975⟩. ⟨hal-03118190⟩



Les métriques sont temporairement indisponibles