On the Sample Complexity of Reinforcement Learning, 2003. ,
Approximately optimal approximate reinforcement learning, ICML, pp.267-274, 2002. ,
Least-squares policy iteration, Journal of Machine Learning Research, vol.4, pp.1107-1149, 2003. ,
Reinforcement Learning as Classification : Leveraging Modern Classifiers, Proceedings of ICML, pp.424-431, 2003. ,
Analysis of a Classification-based Policy Iteration Algorithm, Proceedings of ICML, pp.607-614, 2010. ,
URL : https://hal.archives-ouvertes.fr/inria-00482065
Error Bounds for Approximate Policy Iteration, International Conference on Machine Learning (ICML), pp.560-567, 2003. ,
Performance Bounds in Lp norm for Approximate Value Iteration, SIAM J. Control and Optimization, 2007. ,
URL : https://hal.archives-ouvertes.fr/inria-00124685
Markov Decision Processes, 1994. ,
DOI : 10.1002/9780470316887
Performance Bounds for Lambda Policy Iteration and Application to the Game of Tetris, Journal of Machine Learning Research, vol.14, pp.1175-1221, 2013. ,
URL : https://hal.archives-ouvertes.fr/inria-00185271
On the Use of Non-Stationary Policies for Stationary Infinite-Horizon Markov Decision Processes, NIPS 2012 -Neural Information Processing Systems, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00758809
Approximate Modified Policy Iteration, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00758882