T. Baeck, Evolutionary Algorithms in theory and practice, 1995.

A. Barto, S. Bradtke, and S. Singh, Learning to act using real-time dynamic programming, Artificial Intelligence, vol.72, issue.1-2, 1993.
DOI : 10.1016/0004-3702(94)00011-O

D. Bertsekas and J. Tsitsiklis, Neuro-dynamic programming , athena scientific, 1996.

C. Cervellera and M. Muselli, A Deterministic Learning Approach Based on Discrepancy, Proceedings of WIRN'03, pp.53-60, 2003.
DOI : 10.1007/978-3-540-45216-4_5

L. Chapel and G. Deffuant, Svm viability controller active learning, Kernel machines for reinforcement learning workshop, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00616861

D. A. Cohn, Z. Ghahramani, J. , and M. I. , Active learning with statistical models, Advances in Neural Information Processing Systems, pp.705-712, 1995.

D. A. Cohn, Z. Ghahramani, J. , and M. I. , Active learning with statistical models, Advances in Neural Information Processing Systems, pp.705-712, 1995.

R. Collobert and S. Bengio, Svmtorch: Support vector machines for large-scale regression problems, Journal of Machine Learning Research, vol.1, pp.143-160, 2001.

L. Devroye, L. Gyorfi, A. Krzyzak, and G. Lugosi, On the Strong Universal Consistency of Nearest Neighbor Regression Function Estimates, The Annals of Statistics, vol.22, issue.3, 1994.
DOI : 10.1214/aos/1176325633

A. Eiben and J. Smith, Introduction to Evolutionary Computing, 2003.

S. Gelly, J. Mary, and O. Teytaud, Learning for dynamic programming, proceedings of esann, 2006.

S. Gelly and O. Teytaud, Opendp, a c++ framework for stochastic dynamic programming and reinforcement learning, 2005.

M. Kearns, Y. Mansour, and A. Ng, A sparse sampling algorithm for near-optimal planning in large markov decision processes, IJCAI, pp.1324-1231, 1999.

P. Larranaga and J. A. Lozano, Estimation of Distribution Algorithms. A New Tool for Evolutionary Computation, 2001.

S. Lavalle and M. Branicky, On the relationship between classical grid search and probabilistic roadmaps, Proc. Workshop on the Algorithmic Foundations of Robotics, 2002.

L. 'ecuyer, P. Lemieux, and C. , Recent advances in randomized quasi-monte carlo methods, 2002.

D. Lewis and W. Gale, Training text classifiers by uncertainty sampling, Proceedings of International ACM Conference on Research and Development in Information Retrieval, pp.3-12, 1994.

F. Liang and W. Wong, Real-Parameter Evolutionary Monte Carlo With Applications to Bayesian Mixture Models, Journal of the American Statistical Association, vol.96, issue.454, pp.653-666, 2001.
DOI : 10.1198/016214501753168325

S. R. Lindemann and S. M. Lavalle, Incremental low-discrepancy lattice methods for motion planning, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422), pp.2920-2927, 2003.
DOI : 10.1109/ROBOT.2003.1242039

R. Munos and A. W. Moore, Variable resolution discretization for high-accuracy solutions of optimal control problems, IJCAI, pp.1348-1355, 1999.

H. Niederreiter, Random Number Generation and Quasi-Monte Carlo Methods, 1992.
DOI : 10.1137/1.9781611970081

A. Owen, Quasi-Monte Carlo Sampling, A Chapter, 2003.

J. Rust, Using Randomization to Break the Curse of Dimensionality, Econometrica, vol.65, issue.3, pp.487-516, 1997.
DOI : 10.2307/2171751

G. Schohn and D. Cohn, Less is more: Active learning with support vector machines, Proceedings of the 17 th International Conference on Machine Learning, pp.839-846, 2000.

H. S. Seung, M. Opper, and H. Sompolinsky, Query by committee, Proceedings of the fifth annual workshop on Computational learning theory , COLT '92, pp.287-294, 1992.
DOI : 10.1145/130385.130417

I. Sloan and H. Wo´zniakowskiwo´zniakowski, When Are Quasi-Monte Carlo Algorithms Efficient for High Dimensional Integrals?, Journal of Complexity, vol.14, issue.1, pp.1-33, 1998.
DOI : 10.1006/jcom.1997.0463

R. Sutton and A. Barto, Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998.
DOI : 10.1109/TNN.1998.712192

S. B. Thrun, Efficient exploration in reinforcement learning, 1992.

B. Tuffin, On the use of low discrepancy sequences in Monte Carlo methods, Monte Carlo Methods and Applications, vol.2, issue.4, 1996.
DOI : 10.1515/mcma.1996.2.4.295

M. Vidyasagar, A Theory of Learning and Generalization , with Applications to Neural Networks and Control Systems, 1997.