T. Archibald, K. Mckinnon, T. , and L. , On the Generation of Markov Decision Processes, Journal of the Operational Research Society, vol.46, issue.3, pp.354-361, 1995.
DOI : 10.1057/jors.1995.50

J. A. Bagnell, S. M. Kakade, A. Ng, and J. Schneider, Policy search by dynamic programming, NIPS, 2003.

D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming, Athena Scientific, 1996.
DOI : 10.1007/0-306-48332-7_333

A. M. Farahmand, R. Munos, and C. Szepesvári, Error propagation for approximate policy and value iteration (extended version), NIPS, 2010.

M. Ghavamzadeh and A. Lazaric, Conservative and Greedy Approaches to Classification-based Policy Iteration, AAAI, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00772610

S. Kakade and J. Langford, Approximately optimal approximate reinforcement learning, ICML, 2002.

M. Lagoudakis and R. Parr, Reinforcement Learning as Classification: Leveraging Modern Classifiers, ICML, 2003.

M. G. Lagoudakis and R. Parr, Least-squares policy iteration, Journal of Machine Learning Research (JMLR), vol.4, pp.1107-1149, 2003.

A. Lazaric, M. Ghavamzadeh, M. , and R. , Analysis of a Classification-based Policy Iteration Algorithm, ICML, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00482065

R. Munos, Error Bounds for Approximate Policy Iteration, ICML, 2003.

R. Munos, Performance Bounds in Lp norm for Approximate Value Iteration, SIAM J. Control and Optimization, 2007.
URL : https://hal.archives-ouvertes.fr/inria-00124685

B. Scherrer, Performance Bounds for Lambda Policy Iteration and Application to the Game of Tetris, Journal of Machine Learning Research, vol.14, pp.1175-1221, 2013.
URL : https://hal.archives-ouvertes.fr/inria-00185271

B. Scherrer and B. Lesner, On the Use of Non-Stationary Policies for Stationary Infinite-Horizon Markov Decision Processes, NIPS, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00758809

. Scherrer, . Bruno, . Ghavamzadeh, . Mohammad, . Gabillon et al., Approximate Modified Policy Iteration, ICML, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00758882