Improvements on Learning Tetris with Cross Entropy

Christophe Thiery 1 Bruno Scherrer 1
1 MAIA - Autonomous intelligent machine
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : For playing the game of Tetris well, training a controller by the cross-entropy method seems to be a viable way (Szita and Lȍrincz, 2006; Thiery and Scherrer, 2009). We consider this method to tune an evaluation-based one-piece controller as suggested by Szita and Lȍrincz and we introduce some improvements. In this context, we discuss the influence of the noise, and we perform experiments with several sets of features such as those introduced by Bertsekas and Tsitsiklis (1996), by Dellacherie (Fahey, 2003), and some original features. This approach leads to a controller that outperforms the previous known results. On the original game of Tetris, we show that with probability 0.95 it achieves at least 910, 000 ± 5% lines per game on average. On a simplified version of Tetris considered by most research works, it achieves 35, 000, 000 ± 20% lines per game on average. We used this approach when we took part with the program BCTS in the 2008 Tetris domain Reinforcement Learning Competition and won the competition.
Document type :
Journal articles
Complete list of metadatas

Cited literature [15 references]  Display  Hide  Download

https://hal.inria.fr/inria-00418930
Contributor : Christophe Thiery <>
Submitted on : Tuesday, September 22, 2009 - 11:55:33 AM
Last modification on : Thursday, January 11, 2018 - 6:19:50 AM
Long-term archiving on : Thursday, June 30, 2011 - 11:49:10 AM

File

article.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00418930, version 1

Collections

Citation

Christophe Thiery, Bruno Scherrer. Improvements on Learning Tetris with Cross Entropy. International Computer Games Association Journal, ICGA, 2009, 32. ⟨inria-00418930⟩

Share

Metrics

Record views

759

Files downloads

1945