Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998. ,
DOI : 10.1109/TNN.1998.712192
Temporal differences-based policy iteration and applications in neuro-dynamic programming, 1996. ,
Evaluation of Policy Gradient Methods and Variants on the Cart-Pole Benchmark, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning, pp.254-261, 2007. ,
DOI : 10.1109/ADPRL.2007.368196
Genetic programming-based construction of features for machine learning and knowledge discovery tasks, Genetic Programming and Evolvable Machines, vol.3, issue.4, pp.329-343, 2002. ,
DOI : 10.1023/A:1020984725014
Genetic Programming with a Genetic Algorithm for Feature Construction and Selection, Genetic Programming and Evolvable Machines, vol.2, issue.4, pp.265-281, 2005. ,
DOI : 10.1007/s10710-005-2988-7
Online feature discovery in relational reinforcement learning, Open Problems in Statistical Relational Learning Workshop (SRL-06, 2006. ,
A note on genetic algorithms for large-scale feature selection, Pattern Recognition Letters, vol.10, issue.5, pp.335-347, 1989. ,
DOI : 10.1016/0167-8655(89)90037-8
A survey of genetic feature selection in mining issues, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406), 1321. ,
DOI : 10.1109/CEC.1999.782599
Genetic algorithms for feature selection and weighting, a review and study, Proceedings of Sixth International Conference on Document Analysis and Recognition, p.1240, 2001. ,
DOI : 10.1109/ICDAR.2001.953980
A compiling genetic programming system that directly manipulates the machine code Advances in Genetic Programming, pp.311-331, 1994. ,
Genetic programming: an introduction: on the automatic evolution of computer programs and its applications, 1998. ,
A genome compiler for high performance genetic programming, Proceedings of the Third Annual Conference, pp.86-94, 1998. ,
Swing up control of the Acrobot, Proceedings of the 1994 IEEE International Conference on Robotics and Automation, pp.2356-2361, 1994. ,
DOI : 10.1109/ROBOT.1994.350934
Reinforcement Learning Using Neural Networks, with Applications to Motor Control, 2002. ,
URL : https://hal.archives-ouvertes.fr/tel-00003985
Neuro-Dynamic Programming, Athena Scientific, 1996. ,
Performance bounds for lambda policy iteration, 2007. ,
URL : https://hal.archives-ouvertes.fr/inria-00185271
Policy gradient methods for RL with function approximation, In: NIPS, pp.1057-1063, 1999. ,
Genetic programming II: automatic discovery of reusable programs, 1994. ,