Near-Optimal BRL using Optimistic Local Transitions

Mauricio Araya-López; Vincent Thomas; Olivier Buffet

Communication Dans Un Congrès Année : 2012

Near-Optimal BRL using Optimistic Local Transitions

(1) , (1) , (1)

Mauricio Araya-López

Fonction : Auteur
PersonId : 881106

Autonomous intelligent machine

Vincent Thomas

Fonction : Auteur
PersonId : 16368
IdHAL : vincent-thomas
ORCID : 0000-0003-3401-4649

Autonomous intelligent machine

Olivier Buffet

Fonction : Auteur
PersonId : 1407
IdHAL : olivier-buffet
ORCID : 0000-0002-5072-5857

Autonomous intelligent machine

Résumé

Model-based Bayesian Reinforcement Learning (BRL) allows a sound formalization of the problem of acting optimally while facing an unknown environment, i.e., avoiding the exploration-exploitation dilemma. However, algorithms explicitly addressing BRL suffer from such a combinatorial explosion that a large body of work relies on heuristic algorithms. This paper introduces bolt, a simple and (almost) deterministic heuristic algorithm for BRL which is optimistic about the transition function. We analyze bolt's sample complexity, and show that under certain parameters, the algorithm is near-optimal in the Bayesian sense with high probability. Then, experimental results highlight the key differences of this method compared to previous work.

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

icml12.pdf (270.19 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Olivier Buffet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00755270

Soumis le : mardi 20 novembre 2012-18:44:42

Dernière modification le : jeudi 1 février 2024-10:05:16

Archivage à long terme le : jeudi 21 février 2013-12:32:01

Dates et versions

hal-00755270 , version 1 (20-11-2012)

Identifiants

HAL Id : hal-00755270 , version 1

Citer

Mauricio Araya-López, Vincent Thomas, Olivier Buffet. Near-Optimal BRL using Optimistic Local Transitions. International Conference on Machine Learning - ICML 2012, Jun 2012, Edimburgh, United Kingdom. ⟨hal-00755270⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA UNIV-LORRAINE INRIA2 LORIA LORIA-AIS UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

230 Consultations

90 Téléchargements

Near-Optimal BRL using Optimistic Local Transitions

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager