Error Reducing Sampling in Reinforcement Learning - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

Error Reducing Sampling in Reinforcement Learning

Résumé

In reinforcement learning, an agent collects information interacting with an environment and uses it to derive a behavior. This paper focuses on efficient sampling; that is, the problem of choosing the interaction samples so that the corresponding behavior tends quickly to the optimal behavior. Our main result is a sensitivity analysis relating the choice of sampling any state-action pair to the decrease of an error bound on the optimal solution. We derive two new model-based algorithms. Simulations demonstrate a quicker convergence (in the sense of the number of samples) of the value function to the real optimal value function.
Fichier principal
Vignette du fichier
scherrer.pdf (133.95 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00337659 , version 1 (07-11-2008)

Identifiants

  • HAL Id : inria-00337659 , version 1

Citer

Bruno Scherrer, Shie Mannor. Error Reducing Sampling in Reinforcement Learning. NIPS-08 Workshop on Model Uncertainty and Risk in Reinforcement Learning, Dec 2008, Whistler, Canada. ⟨inria-00337659⟩
89 Consultations
99 Téléchargements

Partager

Gmail Facebook X LinkedIn More