Error reducing sampling in reinforcement learning

Bruno Scherrer; Shie Mannor

Reports (Research Report) Year : 2004

Error reducing sampling in reinforcement learning

(1) , (2)

1
2

Bruno Scherrer

Function : Author
PersonId : 1406
IdHAL : bruno-scherrer
IdRef : 073360708

Autonomous intelligent machine

Shie Mannor

Function : Author

Laboratory for Information and Decision Systems - Massachusetts Institute of Technology

Abstract

In reinforcement learning, an agent collects information interacting with an environment and uses it to derive a behavior. This paper focuses on efficient sampling; that is, the problem of choosing the interaction samples so that the corresponding behavior tends quickly to the optimal behavior. Our main result is a sensitivity analysis relating the choice of sampling any state-action pair to the decrease of an error bound on the optimal solution. We derive two new model-based algorithms. Simulations demonstrate a quicker convergence (in the sense of the number of samples) of the value function to the real optimal value function.

Domains

Artificial Intelligence [cs.AI]

Fichier principal

papier.pdf (201.35 Ko)

Bruno Scherrer : Connect in order to contact the contributor

https://inria.hal.science/inria-00098352

Submitted on : Monday, September 25, 2006-4:13:18 PM

Last modification on : Thursday, February 15, 2024-3:31:36 AM

Long-term archiving on: Tuesday, April 6, 2010-1:10:20 AM

Dates and versions

inria-00098352 , version 1 (25-09-2006)

Identifiers

HAL Id : inria-00098352 , version 1

Cite

Bruno Scherrer, Shie Mannor. Error reducing sampling in reinforcement learning. [Research Report] 2004, pp.15. ⟨inria-00098352⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA UNIV-LORRAINE INRIA2 LORIA LARA UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

67 View

67 Download

Error reducing sampling in reinforcement learning

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Share