Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness

Mathieu Seurin; Florian Strub; Philippe Preux; Olivier Pietquin

Communication Dans Un Congrès Année : 2021

Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness

(1, 2, 3, 4) , (5) , (1, 2, 3, 4) , (6)

1
2
3
4
5
6

Mathieu Seurin

Fonction : Auteur
PersonId : 1039295

Scool

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Université de Lille

Centrale Lille

Florian Strub

Fonction : Auteur

DeepMind [London]

Philippe Preux

Fonction : Auteur

Scool

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Université de Lille

Centrale Lille

Olivier Pietquin

Fonction : Auteur

Google Research [Paris]

Résumé

Sparse rewards are double-edged training signals in reinforcement learning: easy to design but hard to optimize. Intrinsic motivation guidances have thus been developed toward alleviating the resulting exploration problem. They usually incentivize agents to look for new states through novelty signals. Yet, such methods encourage exhaustive exploration of the state space rather than focusing on the environment's salient interaction opportunities. We propose a new exploration method, called Don't Do What Doesn't Matter (DoWhaM), shifting the emphasis from state novelty to state with relevant actions. While most actions consistently change the state when used, e.g. moving the agent, some actions are only effective in specific states, e.g., opening a door, grabbing an object. DoWhaM detects and rewards actions that seldom affect the environment. We evaluate DoWhaM on the procedurallygenerated environment MiniGrid, against state-ofthe-art methods. Experiments consistently show that DoWhaM greatly reduces sample complexity, installing the new state-of-the-art in MiniGrid.

Domaines

Apprentissage [cs.LG]

Fichier principal

Rare_Actions_Matter_IJCAI.pdf (2.97 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Mathieu Seurin : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03259315

Soumis le : lundi 14 juin 2021-09:19:15

Dernière modification le : mercredi 24 janvier 2024-09:54:22

Archivage à long terme le : jeudi 16 septembre 2021-08:21:05

Dates et versions

hal-03259315 , version 1 (14-06-2021)

Identifiants

HAL Id : hal-03259315 , version 1

Citer

Mathieu Seurin, Florian Strub, Philippe Preux, Olivier Pietquin. Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness. Internationnal Joint Conference on Artificial Intelligence (IJCAI), Aug 2021, Montreal, Canada. pp.2950--2956. ⟨hal-03259315⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA GRID5000 CRISTAL INRIA2 UNIV-LILLE SILECS CRISTAL-SCOOL

79 Consultations

93 Téléchargements

Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager