Odalric-Ambrym Maillard, Phuong Nguyen, Ronald Ortner, Daniil Ryabko. Optimal Regret Bounds for Selecting the State Representation in Reinforcement Learning.
ICML - 30th International Conference on Machine Learning, 2013, Atlanta, USA, United States. pp.543-551.
⟨hal-00778586⟩