MERL: Multi-Head Reinforcement Learning

Yannis Flet-Berliac; Philippe Preux

Communication Dans Un Congrès Année : 2019

MERL: Multi-Head Reinforcement Learning

(1, 2, 3) , (1, 2, 3)

1
2
3

Yannis Flet-Berliac

Fonction : Auteur
PersonId : 174111
IdHAL : yannis-flet-berliac
ORCID : 0000-0002-1191-0048

Sequential Learning

Université de Lille

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Philippe Preux

Fonction : Auteur
PersonId : 5488
IdHAL : preux-philippe
IdRef : 059896353

Sequential Learning

Université de Lille

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Résumé

A common challenge in reinforcement learning is how to efficiently sample an environment to convert the agent's interactions into fast and robust learning, leading to high performance in complex tasks. For instance, earlier work makes use of domain/prior knowledge to improve existing reinforcement learning algorithms. While promising, previously acquired knowledge is often costly and challenging to scale up. Instead, we decide to consider the use of problem knowledge, which constitutes signals from any relevant quantity useful to solve many tasks, e.g., self-performance assessment and accurate expectations. We propose MERL, a general framework for structuring reinforcement learning by injecting problem knowledge into policy gradient updates. Unlike other auxiliary tasks methods, MERL is generally applicable to any task. As a result, policy and value functions are no longer only optimized for a reward but are learned using task-agnostic quantities. In this paper: (a) We introduce and define MERL, our new multi-head reinforcement learning framework. (b) We conduct experiments across a variety of standard benchmark environments, including 9 continuous control tasks where results show improved performance. (c) We demonstrate that MERL also improves transfer learning on a set of challenging tasks. (d) We investigate how our approach tackles the problem of reward sparsity and better condition the feature space in the context of deep reinforcement learning agents.

Mots clés

auxiliary tasks policy gradient reinforcement learning

Domaines

Apprentissage [cs.LG] Machine Learning [stat.ML] Intelligence artificielle [cs.AI]

Fichier principal

main.pdf (1.43 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Yannis Flet-Berliac : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02305105

Soumis le : jeudi 3 octobre 2019-17:15:53

Dernière modification le : vendredi 16 février 2024-11:12:07

Dates et versions

hal-02305105 , version 1 (03-10-2019)

hal-02305105 , version 2 (13-10-2019)

hal-02305105 , version 3 (29-11-2019)

Identifiants

HAL Id : hal-02305105 , version 1

Citer

Yannis Flet-Berliac, Philippe Preux. MERL: Multi-Head Reinforcement Learning. Deep Reinforcement Learning Workshop (NeurIPS 2019), Dec 2019, Vancouver, Canada. ⟨hal-02305105v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

279 Consultations

1090 Téléchargements

MERL: Multi-Head Reinforcement Learning

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager