Neurodynamic Programming, 1996. ,
A tutorial on the cross-entropy method, Annals of Operations Research, vol.1, issue.134, pp.19-67, 2004. ,
Cross-Entropy for Monte-Carlo Tree Search, J. van den ICGA Journal, vol.31, issue.3, pp.145-157, 2008. ,
Tetris is Hard, Even to Approximate, Proc. 9th International Computing and Combinatorics Conference, pp.351-363, 2003. ,
DOI : 10.1007/3-540-45071-8_36
Tetris AI, Computer plays Tetris, 2003. ,
Tetris: A Study of Randomized Constraint Sampling, 2006. ,
DOI : 10.1007/1-84628-095-8_6
Feature Discovery in Reinforcement Learning Using Genetic Programming, 2007. ,
DOI : 10.1007/978-3-540-78671-9_19
URL : https://hal.archives-ouvertes.fr/hal-00826056
Completely Derandomized Self-Adaptation in Evolution Strategies, Evolutionary Computation, vol.9, issue.2, pp.159-195, 2001. ,
DOI : 10.1016/0004-3702(95)00124-7
A natural policy gradient, Advances in Neural Information Processing Systems (NIPS 14), pp.1531-1538, 2001. ,
Least-Squares Methods in Reinforcement Learning for Control, SETN '02: Proceedings of the Second Hellenic Conference on AI, pp.249-260, 2002. ,
DOI : 10.1007/3-540-46014-4_23
Xtris readme, 2005. ,
On the numeric stability of gaussian processes regression for relational reinforcement learning, ICML-2004 Workshop on Relational Reinforcement Learning, pp.10-14, 2004. ,
Learning Tetris Using the Noisy Cross-Entropy Method, Neural Computation, vol.18, issue.12, pp.2936-2941, 2006. ,
DOI : 10.1007/s10479-005-5732-z
Building Controllers for Tetris, ICGA Journal, vol.32, issue.1, pp.3-11, 2009. ,
DOI : 10.3233/ICG-2009-32102
URL : https://hal.archives-ouvertes.fr/inria-00418954
Feature-Based Methods for Large Scale Dynamic Programming, Machine Learning, vol.22, pp.59-94, 1996. ,