Improvements on Learning Tetris with Cross Entropy - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue International Computer Games Association Journal Année : 2009

Improvements on Learning Tetris with Cross Entropy

Christophe Thiery
  • Fonction : Auteur
  • PersonId : 842769
Bruno Scherrer

Résumé

For playing the game of Tetris well, training a controller by the cross-entropy method seems to be a viable way (Szita and Lȍrincz, 2006; Thiery and Scherrer, 2009). We consider this method to tune an evaluation-based one-piece controller as suggested by Szita and Lȍrincz and we introduce some improvements. In this context, we discuss the influence of the noise, and we perform experiments with several sets of features such as those introduced by Bertsekas and Tsitsiklis (1996), by Dellacherie (Fahey, 2003), and some original features. This approach leads to a controller that outperforms the previous known results. On the original game of Tetris, we show that with probability 0.95 it achieves at least 910, 000 ± 5% lines per game on average. On a simplified version of Tetris considered by most research works, it achieves 35, 000, 000 ± 20% lines per game on average. We used this approach when we took part with the program BCTS in the 2008 Tetris domain Reinforcement Learning Competition and won the competition.
Fichier principal
Vignette du fichier
article.pdf (145.94 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00418930 , version 1 (22-09-2009)

Identifiants

  • HAL Id : inria-00418930 , version 1

Citer

Christophe Thiery, Bruno Scherrer. Improvements on Learning Tetris with Cross Entropy. International Computer Games Association Journal, 2009, 32. ⟨inria-00418930⟩
548 Consultations
2851 Téléchargements

Partager

Gmail Facebook X LinkedIn More