R. Bertsekas, D. Tsitsiklis, and J. , Neurodynamic Programming, 1996.

P. Boer, . De, D. Kroese, S. Mannor, and R. Rubinstein, A tutorial on the cross-entropy method, Annals of Operations Research, vol.1, issue.134, pp.19-67, 2004.

G. Chaslot, M. Winands, I. Szita, and H. Herik, Cross-Entropy for Monte-Carlo Tree Search, J. van den ICGA Journal, vol.31, issue.3, pp.145-157, 2008.

E. D. Demaine, S. Hohenberger, and D. Liben-nowell, Tetris is Hard, Even to Approximate, Proc. 9th International Computing and Combinatorics Conference, pp.351-363, 2003.
DOI : 10.1007/3-540-45071-8_36

C. P. Fahey, Tetris AI, Computer plays Tetris, 2003.

V. Farias, B. Roy, and . Van, Tetris: A Study of Randomized Constraint Sampling, 2006.
DOI : 10.1007/1-84628-095-8_6

S. Girgin and P. Preux, Feature Discovery in Reinforcement Learning Using Genetic Programming, 2007.
DOI : 10.1007/978-3-540-78671-9_19

URL : https://hal.archives-ouvertes.fr/hal-00826056

N. Hansen and A. Ostermeier, Completely Derandomized Self-Adaptation in Evolution Strategies, Evolutionary Computation, vol.9, issue.2, pp.159-195, 2001.
DOI : 10.1016/0004-3702(95)00124-7

S. Kakade, A natural policy gradient, Advances in Neural Information Processing Systems (NIPS 14), pp.1531-1538, 2001.

M. G. Lagoudakis, R. Parr, and M. L. Littman, Least-Squares Methods in Reinforcement Learning for Control, SETN '02: Proceedings of the Second Hellenic Conference on AI, pp.249-260, 2002.
DOI : 10.1007/3-540-46014-4_23

R. E. Llima, Xtris readme, 2005.

J. Ramon and K. Driessens, On the numeric stability of gaussian processes regression for relational reinforcement learning, ICML-2004 Workshop on Relational Reinforcement Learning, pp.10-14, 2004.

I. Szita, L. Orincz, and A. , Learning Tetris Using the Noisy Cross-Entropy Method, Neural Computation, vol.18, issue.12, pp.2936-2941, 2006.
DOI : 10.1007/s10479-005-5732-z

C. Thiery and B. Scherrer, Building Controllers for Tetris, ICGA Journal, vol.32, issue.1, pp.3-11, 2009.
DOI : 10.3233/ICG-2009-32102

URL : https://hal.archives-ouvertes.fr/inria-00418954

J. N. Tsitsiklis, B. Roy, and . Van, Feature-Based Methods for Large Scale Dynamic Programming, Machine Learning, vol.22, pp.59-94, 1996.