Bayesian Reinforcement Learning

Nikos Vlassis; Mohammad Ghavamzadeh; Shie Mannor; Pascal Poupart

Chapitre D'ouvrage Année : 2012

Bayesian Reinforcement Learning

(1) , (2) , (3) , (4)

1
2
3
4

Nikos Vlassis

Fonction : Auteur

Intelligent Systems Lab.

Mohammad Ghavamzadeh

Fonction : Auteur
PersonId : 868946

Sequential Learning

Shie Mannor

Fonction : Auteur

Department of Electrical Engineering - Technion [Haïfa]

Pascal Poupart

Fonction : Auteur

School of Computer Science [Waterloo]

Résumé

This chapter surveys recent lines of work that use Bayesian techniques for reinforcement learning. In Bayesian learning, uncertainty is expressed by a prior distribution over unknown parameters and learning is achieved by computing a posterior distribution based on the data observed. Hence, Bayesian reinforcement learning distinguishes itself from other forms of reinforcement learning by explicitly maintaining a distribution over various quantities such as the parameters of the model, the value function, the policy or its gradient. This yields several benefits: a) domain knowledge can be naturally encoded in the prior distribution to speed up learning; b) the exploration/exploitation tradeoff can be naturally optimized; and c) notions of risk can be naturally taken into account to obtain robust policies.

Domaines

Informatique

Fichier principal

BRLchapter.pdf (162.56 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Mohammad Ghavamzadeh : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00840479

Soumis le : mardi 2 juillet 2013-15:31:33

Dernière modification le : mardi 31 octobre 2023-13:44:06

Archivage à long terme le : jeudi 3 octobre 2013-10:45:23

Dates et versions

hal-00840479 , version 1 (02-07-2013)

Identifiants

HAL Id : hal-00840479 , version 1

Citer

Nikos Vlassis, Mohammad Ghavamzadeh, Shie Mannor, Pascal Poupart. Bayesian Reinforcement Learning. Marco Wiering and Martijn van Otterlo. Reinforcement Learning: State of the Art, Springer Verlag, 2012. ⟨hal-00840479⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA LAGIS INRIA2

435 Consultations

2399 Téléchargements

Bayesian Reinforcement Learning

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager