Skip to Main content Skip to Navigation
Reports

Modification of UCT with Patterns in Monte-Carlo Go

Sylvain Gelly 1 Yizao Wang 1, 2 Rémi Munos 2, 3 Olivier Teytaud 1
1 TANC - Algorithmic number theory for cryptology
Inria Saclay - Ile de France, LIX - Laboratoire d'informatique de l'École polytechnique [Palaiseau]
3 SEQUEL - Sequential Learning
LIFL - Laboratoire d'Informatique Fondamentale de Lille, Inria Lille - Nord Europe, LAGIS - Laboratoire d'Automatique, Génie Informatique et Signal
Abstract : Algorithm UCB1 for multi-armed bandit problem has already been extended to Algorithm UCT (Upper bound Confidence for Tree) which works for minimax tree search. We have developed a Monte-Carlo Go program, MoGo, which is the first computer Go program using UCT. We explain our modification of UCT for Go application and also the intelligent random simulation with patterns which has improved significantly the performance of MoGo. UCT combined with pruning techniques for large Go board is discussed, as well as parallelization of UCT. MoGo is now a top level Go program on $9\times9$ and $13\times13$ Go boards.
Complete list of metadata

https://hal.inria.fr/inria-00117266
Contributor : Rapport de Recherche Inria <>
Submitted on : Tuesday, December 12, 2006 - 10:35:11 AM
Last modification on : Tuesday, November 24, 2020 - 2:18:20 PM
Long-term archiving on: : Friday, November 25, 2016 - 1:58:27 PM

File

RR-6062.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00117266, version 2

Collections

Citation

Sylvain Gelly, Yizao Wang, Rémi Munos, Olivier Teytaud. Modification of UCT with Patterns in Monte-Carlo Go. [Research Report] RR-6062, 2006. ⟨inria-00117266v2⟩

Share

Metrics

Record views

67

Files downloads

115