Learning Exploration Strategies in Model-Based Reinforcement Learning

Todd Hester; Peter Stone; Manuel Lopes

Communication Dans Un Congrès Année : 2013

Learning Exploration Strategies in Model-Based Reinforcement Learning

(1) , (1) , (2)

1
2

Todd Hester

Fonction : Auteur

Departement of Computer Science [Austin]

Peter Stone

Fonction : Auteur

Departement of Computer Science [Austin]

Manuel Lopes

Fonction : Auteur
PersonId : 1873
IdHAL : manuel-lopes
ORCID : 0000-0002-6238-8974
IdRef : 188282947

Flowing Epigenetic Robots and Systems

Résumé

Reinforcement learning (RL) is a paradigm for learning sequential decision making tasks. However, typically the user must hand-tune exploration parameters for each different domain and/or algorithm that they are using. In this work, we present an algorithm called leo for learning these exploration strategies on-line. This algorithm makes use of bandit-type algorithms to adaptively select exploration strategies based on the rewards received when following them. We show empirically that this method performs well across a set of five domains. In contrast, for a given algorithm, no set of parameters is best across all domains. Our results demonstrate that the leo algorithm successfully learns the best exploration strategies on-line, increasing the received reward over static parameterizations of exploration and reducing the need for hand-tuning exploration parameters.

Domaines

Apprentissage [cs.LG]

Manuel Lopes : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00871861

Soumis le : jeudi 10 octobre 2013-17:38:29

Dernière modification le : mercredi 15 mars 2023-08:50:07

Dates et versions

hal-00871861 , version 1 (10-10-2013)

Identifiants

HAL Id : hal-00871861 , version 1

Citer

Todd Hester, Peter Stone, Manuel Lopes. Learning Exploration Strategies in Model-Based Reinforcement Learning. AAMAS 2013 - 12th International Conference on Autonomous Agents and Multiagent Systems, May 2013, St. Paul, MN, United States. pp.1069-1076. ⟨hal-00871861⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENSTA INRIA PARISTECH ENSTA_U2IS INRIA2

164 Consultations

0 Téléchargements

Learning Exploration Strategies in Model-Based Reinforcement Learning

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager