Adding expert knowledge and exploration in Monte-Carlo Tree Search

Guillaume Chaslot; Christophe Fiter; Jean-Baptiste Hoock; Arpad Rimmel; Olivier Teytaud

Conference Papers Year : 2009

Adding expert knowledge and exploration in Monte-Carlo Tree Search

(1) , (2) , (2) , (2) , (2, 3, 4)

1
2
3
4

Guillaume Chaslot

Function : Author

Maastricht University [Maastricht]

Christophe Fiter

Function : Author
PersonId : 17255
IdHAL : christophe-fiter
ORCID : 0000-0002-7360-7415
IdRef : 166685917

Machine Learning and Optimisation

Jean-Baptiste Hoock

Function : Author

Machine Learning and Optimisation

Arpad Rimmel

Function : Author
PersonId : 18807
IdHAL : arpad-rimmel
IdRef : 140527273

Machine Learning and Optimisation

Olivier Teytaud

Function : Author
PersonId : 581
IdHAL : olivier-teytaud
IdRef : 05971008X

Machine Learning and Optimisation

Algorithmic number theory for cryptology

Laboratoire de Recherche en Informatique

Abstract

We present a new exploration term, more efficient than clas- sical UCT-like exploration terms and combining efficiently expert rules, patterns extracted from datasets, All-Moves-As-First values and classi- cal online values. As this improved bandit formula does not solve several important situations (semeais, nakade) in computer Go, we present three other important improvements which are central in the recent progress of our program MoGo: { We show an expert-based improvement of Monte-Carlo simulations for nakade situations; we also emphasize some limitations of this modification. { We show a technique which preserves diversity in the Monte-Carlo simulation, which greatly improves the results in 19x19. { Whereas the UCB-based exploration term is not efficient in MoGo, we show a new exploration term which is highly efficient in MoGo. MoGo recently won a game with handicap 7 against a 9Dan Pro player, Zhou JunXun, winner of the LG Cup 2007, and a game with handicap 6 against a 1Dan pro player, Li-Chen Chien.

Domains

Optimization and Control [math.OC]

Fichier principal

peacg.pdf (388.17 Ko)

Origin : Files produced by the author(s)

Olivier Teytaud : Connect in order to contact the contributor

https://inria.hal.science/inria-00386477

Submitted on : Thursday, May 21, 2009-10:29:55 PM

Last modification on : Monday, February 12, 2024-9:48:04 AM

Long-term archiving on: Monday, October 15, 2012-10:51:28 AM

Dates and versions

inria-00386477 , version 1 (21-05-2009)

Identifiers

HAL Id : inria-00386477 , version 1

Cite

Guillaume Chaslot, Christophe Fiter, Jean-Baptiste Hoock, Arpad Rimmel, Olivier Teytaud. Adding expert knowledge and exploration in Monte-Carlo Tree Search. Advances in Computer Games, 2009, Pamplona, Spain. ⟨inria-00386477⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X EC-PARIS CNRS INRIA LIX INSMI X-LIX X-DEP-INFO UMR8623 INRIA2 LRI-AO TDS-MACS UNIV-PARIS-SACLAY

294 View

945 Download

Adding expert knowledge and exploration in Monte-Carlo Tree Search

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Share