Bayesian Multi-Task Reinforcement Learning

Alessandro Lazaric 1, * Mohammad Ghavamzadeh 1
* Auteur correspondant
1 SEQUEL - Sequential Learning
LIFL - Laboratoire d'Informatique Fondamentale de Lille, Inria Lille - Nord Europe, LAGIS - Laboratoire d'Automatique, Génie Informatique et Signal
Abstract : We consider the problem of multi-task reinforcement learning where the learner is provided with a set of tasks, for which only a small number of samples can be generated for any given policy. As the number of samples may not be enough to learn an accurate evaluation of the policy, it would be necessary to identify classes of tasks with similar structure and to learn them jointly. We consider the case where the tasks share structure in their value functions, and model this by assuming that the value functions are all sampled from a common prior. We adopt the Gaussian process temporal-difference value function model and use a hierarchical Bayesian approach to model the distribution over the value functions. We study two cases, where all the value functions belong to the same class and where they belong to an undefined number of classes. For each case, we present a hierarchical Bayesian model, and derive inference algorithms for (i) joint learning of the value functions, and (ii) efficient transfer of the information gained in (i) to assist learning the value function of a newly observed task.
Type de document :
Communication dans un congrès
ICML - 27th International Conference on Machine Learning, Jun 2010, Haifa, Israel. Omnipress, pp.599-606, 2010
Liste complète des métadonnées

Littérature citée [17 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00475214
Contributeur : Mohammad Ghavamzadeh <>
Soumis le : mercredi 21 avril 2010 - 14:43:12
Dernière modification le : jeudi 11 janvier 2018 - 06:22:13
Document(s) archivé(s) le : mardi 28 septembre 2010 - 13:08:32

Fichier

bmtl.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00475214, version 1

Collections

Citation

Alessandro Lazaric, Mohammad Ghavamzadeh. Bayesian Multi-Task Reinforcement Learning. ICML - 27th International Conference on Machine Learning, Jun 2010, Haifa, Israel. Omnipress, pp.599-606, 2010. 〈inria-00475214〉

Partager

Métriques

Consultations de la notice

755

Téléchargements de fichiers

548