Performance bound for Approximate Optimistic Policy Iteration

Bruno Scherrer; Christophe Thiery

Rapport (Rapport Technique) Année : 2010

Performance bound for Approximate Optimistic Policy Iteration

(1) , (1)

Bruno Scherrer

Fonction : Auteur
PersonId : 1406
IdHAL : bruno-scherrer
IdRef : 073360708

Autonomous intelligent machine

Christophe Thiery

Fonction : Auteur
PersonId : 842769

Autonomous intelligent machine

Résumé

We provide a proof of the performance bound theorem published in "Least-Squares λ Policy Iteration: Bias-Variance Trade-off in Control Problems" (ICML 2010).

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

opi_proof.pdf (60.41 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Christophe Thiery : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00480952

Soumis le : mercredi 5 mai 2010-14:56:27

Dernière modification le : vendredi 24 mars 2023-14:52:53

Archivage à long terme le : jeudi 16 septembre 2010-13:36:05

Dates et versions

inria-00480952 , version 1 (05-05-2010)

Identifiants

HAL Id : inria-00480952 , version 1

Citer

Bruno Scherrer, Christophe Thiery. Performance bound for Approximate Optimistic Policy Iteration. [Technical Report] 2010. ⟨inria-00480952⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA LARA

198 Consultations

329 Téléchargements

Performance bound for Approximate Optimistic Policy Iteration

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager