Bellmanian Bandit Network

Antoine Bureau; Michèle Sebag

Communication Dans Un Congrès Année : 2014

Bellmanian Bandit Network

(1, 2) , (1, 3, 2)

1
2
3

Antoine Bureau

Fonction : Auteur
PersonId : 963035

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Michèle Sebag

Fonction : Auteur
PersonId : 836537

Laboratoire de Recherche en Informatique

Centre National de la Recherche Scientifique

Machine Learning and Optimisation

Résumé

This paper presents a new reinforcement learning (RL) algorithm called Bellmanian Bandit Network (BBN), where action selection in each state is formalized as a multi-armed bandit problem. The first contribution lies in the definition of an exploratory reward inspired from the intrinsic motivation criterion [1], combined with the RL reward. The second contribution is to use a network of multi-armed bandits to achieve the convergence toward the optimal Q-value function. The BBN algorithm is validated in stationary and non-stationary grid-world environments, comparatively to [1].

Mots clés

Reinforcement Learning Model-based approach Intrinsic Motivation

Domaines

Informatique [cs] Apprentissage [cs.LG]

Fichier principal

nips14_BBN.pdf (489.7 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Antoine Bureau : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01102970

Soumis le : mardi 13 janvier 2015-17:18:23

Dernière modification le : lundi 12 février 2024-09:48:04

Archivage à long terme le : samedi 15 avril 2017-17:22:40

Dates et versions

hal-01102970 , version 1 (13-01-2015)

Identifiants

HAL Id : hal-01102970 , version 1

Citer

Antoine Bureau, Michèle Sebag. Bellmanian Bandit Network. Autonomously Learning Robots, at NIPS 2014, Gerhard Neumann (TU-Darmstadt); Joelle Pineau (McGill University); Peter Auer (Uni Leoben); Marc Toussaint (Uni Stuttgart), Dec 2014, Montréal, Canada. ⟨hal-01102970⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS CNRS INRIA UMR8623 INRIA2 LRI-AO UNIV-PARIS-SACLAY

410 Consultations

190 Téléchargements

Bellmanian Bandit Network

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager