R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998.
DOI : 10.1109/TNN.1998.712192

D. Bertsekas and S. Ioffe, Temporal differences-based policy iteration and applications in neuro-dynamic programming, 1996.

M. Riedmiller, J. Peters, and S. Schaal, Evaluation of Policy Gradient Methods and Variants on the Cart-Pole Benchmark, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning, pp.254-261, 2007.
DOI : 10.1109/ADPRL.2007.368196

K. Krawiec, Genetic programming-based construction of features for machine learning and knowledge discovery tasks, Genetic Programming and Evolvable Machines, vol.3, issue.4, pp.329-343, 2002.
DOI : 10.1023/A:1020984725014

M. G. Smith and L. Bull, Genetic Programming with a Genetic Algorithm for Feature Construction and Selection, Genetic Programming and Evolvable Machines, vol.2, issue.4, pp.265-281, 2005.
DOI : 10.1007/s10710-005-2988-7

S. Sanner, Online feature discovery in relational reinforcement learning, Open Problems in Statistical Relational Learning Workshop (SRL-06, 2006.

W. Siedlecki and J. Sklansky, A note on genetic algorithms for large-scale feature selection, Pattern Recognition Letters, vol.10, issue.5, pp.335-347, 1989.
DOI : 10.1016/0167-8655(89)90037-8

M. J. Martin-bautista and M. A. Vila, A survey of genetic feature selection in mining issues, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406), 1321.
DOI : 10.1109/CEC.1999.782599

F. Hussein, Genetic algorithms for feature selection and weighting, a review and study, Proceedings of Sixth International Conference on Document Analysis and Recognition, p.1240, 2001.
DOI : 10.1109/ICDAR.2001.953980

P. Nordin, A compiling genetic programming system that directly manipulates the machine code Advances in Genetic Programming, pp.311-331, 1994.

W. Banzhaf, F. D. Francone, R. E. Keller, and P. Nordin, Genetic programming: an introduction: on the automatic evolution of computer programs and its applications, 1998.

A. Fukunaga, A. Stechert, and D. Mutz, A genome compiler for high performance genetic programming, Proceedings of the Third Annual Conference, pp.86-94, 1998.

M. W. Spong, Swing up control of the Acrobot, Proceedings of the 1994 IEEE International Conference on Robotics and Automation, pp.2356-2361, 1994.
DOI : 10.1109/ROBOT.1994.350934

R. Coulom, Reinforcement Learning Using Neural Networks, with Applications to Motor Control, 2002.
URL : https://hal.archives-ouvertes.fr/tel-00003985

D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming, Athena Scientific, 1996.

B. Scherrer, Performance bounds for lambda policy iteration, 2007.
URL : https://hal.archives-ouvertes.fr/inria-00185271

R. S. Sutton, D. A. Mcallester, S. P. Singh, and Y. Mansour, Policy gradient methods for RL with function approximation, In: NIPS, pp.1057-1063, 1999.

J. R. Koza, Genetic programming II: automatic discovery of reusable programs, 1994.