Conference Papers
Year : 2012
Daniil Ryabko : Connect in order to contact the contributor
https://hal.inria.fr/hal-00765441
Submitted on : Friday, December 14, 2012-3:59:32 PM
Last modification on : Friday, March 24, 2023-2:52:56 PM
Dates and versions
Identifiers
- HAL Id : hal-00765441 , version 1
Cite
Ronald Ortner, Daniil Ryabko. Online Regret Bounds for Undiscounted Continuous Reinforcement Learning. NIPS 2012, 2012, Lake Tahoe, United States. pp.1772--1780. ⟨hal-00765441⟩
Collections
6799
View
0
Download