Error reducing sampling in reinforcement learning

Bruno Scherrer; Shie Mannor

Rapport (Rapport De Recherche) Année : 2004

Error reducing sampling in reinforcement learning

(1) , (2)

1
2

Bruno Scherrer

Fonction : Auteur
PersonId : 1406
IdHAL : bruno-scherrer
IdRef : 073360708

Autonomous intelligent machine

Shie Mannor

Fonction : Auteur

Laboratory for Information and Decision Systems - Massachusetts Institute of Technology

Résumé

In reinforcement learning, an agent collects information interacting with an environment and uses it to derive a behavior. This paper focuses on efficient sampling; that is, the problem of choosing the interaction samples so that the corresponding behavior tends quickly to the optimal behavior. Our main result is a sensitivity analysis relating the choice of sampling any state-action pair to the decrease of an error bound on the optimal solution. We derive two new model-based algorithms. Simulations demonstrate a quicker convergence (in the sense of the number of samples) of the value function to the real optimal value function.

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

papier.pdf (201.35 Ko)

Bruno Scherrer : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00098352

Soumis le : lundi 25 septembre 2006-16:13:18

Dernière modification le : jeudi 15 février 2024-03:31:36

Archivage à long terme le : mardi 6 avril 2010-01:10:20

Dates et versions

inria-00098352 , version 1 (25-09-2006)

Identifiants

HAL Id : inria-00098352 , version 1

Citer

Bruno Scherrer, Shie Mannor. Error reducing sampling in reinforcement learning. [Research Report] 2004, pp.15. ⟨inria-00098352⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA UNIV-LORRAINE INRIA2 LORIA LARA UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

67 Consultations

65 Téléchargements

Error reducing sampling in reinforcement learning

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager