Skip to Main content Skip to Navigation
Documents associated with scientific events

Optimisme en apprentissage par renforcement et divergence de Kullback-Leibler

Abstract : We consider model-based reinforcement learning in finite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies.
Document type :
Documents associated with scientific events
Complete list of metadata

https://hal.inria.fr/inria-00510327
Contributor : Conférence Mas2010 <>
Submitted on : Wednesday, August 18, 2010 - 9:45:08 AM
Last modification on : Friday, November 6, 2020 - 11:36:03 PM
Long-term archiving on: : Friday, November 19, 2010 - 2:31:53 AM

File

REN-Filippi.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00510327, version 1

Citation

Sarah Filippi, Olivier Cappé, Aurélien Garivier. Optimisme en apprentissage par renforcement et divergence de Kullback-Leibler. Journées MAS et Journée en l'honneur de Jacques Neveu, Aug 2010, Talence, France. ⟨inria-00510327⟩

Share

Metrics

Record views

211

Files downloads

115