D. P. Bertsekas and J. N. Tsitsiklis, Neuro-dynamic programming, athena scientific, 1996.

A. G. Barto, S. J. Bradtke, and S. P. Singh, Learning to act using real-time dynamic programming, Artificial Intelligence, vol.72, issue.1-2, 1993.
DOI : 10.1016/0004-3702(94)00011-O

R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998.
DOI : 10.1109/TNN.1998.712192

R. Coulom, High-accuracy value-function approximation with neural networks, 2004.
URL : https://hal.archives-ouvertes.fr/inria-00107776

R. Coulom, Reinforcement Learning Using Neural Networks, with Applications to Motor Control, 2002.
URL : https://hal.archives-ouvertes.fr/tel-00003985

R. Munos and A. Moore, Variable resolution discretization in optimal control, 1999.

R. S. Sutton, Generalization in reinforcement learning: Successful examples using sparse coarse coding, Advances in Neural Information Processing Systems, pp.1038-1044

D. Precup and R. S. Sutton, Exponentiated gradient methods for reinforcement learning, Proc. 14th International Conference on Machine Learning, pp.272-277, 1997.

J. Yoshimoto, S. Ishii, and M. Sato, Application of reinforcement learning to balancing of Acrobot, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028), 1999.
DOI : 10.1109/ICSMC.1999.815605

C. W. Anderson, Q-learning with hidden-unit restarting, Advances in Neural Information Processing Systems 5, pp.81-88, 1993.

R. Kretchmar and C. Anderson, Comparison of CMACs and radial basis functions for local function approximators in reinforcement learning, Proceedings of International Conference on Neural Networks (ICNN'97), 1997.
DOI : 10.1109/ICNN.1997.616132

B. Ratitch and D. Precup, Sparse Distributed Memories for On-Line Value-Based Reinforcement Learning, pp.347-358, 2004.
DOI : 10.1007/978-3-540-30115-8_33

S. Keerthi and B. Ravindran, A tutorial survey of reinforcement learning, 1995.

R. Munos, Error bounds for approximate value iteration, Proceedings of AAAI 2005, 2005.
DOI : 10.1137/040614384

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.466.4205

I. H. Witten and E. Frank, Data mining, ACM SIGMOD Record, vol.31, issue.1, 2005.
DOI : 10.1145/507338.507355

R. Kohavi, The power of decision tables, Proceedings of the European Conference on Machine Learning, pp.174-189, 1995.
DOI : 10.1007/3-540-59286-5_57

G. John, L. E. Cleary, and . Trigg, K*: an instance-based learner using an entropic distance measure, Proc. 12th International Conference on Machine Learning, pp.108-114, 1995.

P. Rousseeuw and A. Leroy, Robust Regression and Outlier Detection, 1987.
DOI : 10.1002/0471725382

J. Otero and L. Sanchez, Induction of descriptive fuzzy classifiers with the logitboost algorithm. soft computing, 2005.

A. J. Smola and B. Scholkopf, A tutorial on support vector regression, Statistics and Computing, vol.14, issue.3, 1998.
DOI : 10.1023/B:STCO.0000035301.49549.88

S. K. Shevade, S. S. Keerthi, C. Bhattacharyya, and K. R. Murthy, Improvements to the SMO algorithm for SVM regression, IEEE Transactions on Neural Networks, vol.11, issue.5, 1999.
DOI : 10.1109/72.870050

H. Niederreiter, Random Number Generation and Quasi-Monte Carlo Methods, 1992.
DOI : 10.1137/1.9781611970081