Adding expert knowledge and exploration in Monte-Carlo Tree Search

Guillaume Chaslot; Christophe Fiter; Jean-Baptiste Hoock; Arpad Rimmel; Olivier Teytaud

Communication Dans Un Congrès Année : 2009

Adding expert knowledge and exploration in Monte-Carlo Tree Search

(1) , (2) , (2) , (2) , (2, 3, 4)

1
2
3
4

Guillaume Chaslot

Fonction : Auteur

Maastricht University [Maastricht]

Christophe Fiter

Fonction : Auteur
PersonId : 17255
IdHAL : christophe-fiter
ORCID : 0000-0002-7360-7415
IdRef : 166685917

Machine Learning and Optimisation

Jean-Baptiste Hoock

Fonction : Auteur

Machine Learning and Optimisation

Arpad Rimmel

Fonction : Auteur
PersonId : 18807
IdHAL : arpad-rimmel
IdRef : 140527273

Machine Learning and Optimisation

Olivier Teytaud

Fonction : Auteur
PersonId : 581
IdHAL : olivier-teytaud
IdRef : 05971008X

Machine Learning and Optimisation

Algorithmic number theory for cryptology

Laboratoire de Recherche en Informatique

Résumé

We present a new exploration term, more efficient than clas- sical UCT-like exploration terms and combining efficiently expert rules, patterns extracted from datasets, All-Moves-As-First values and classi- cal online values. As this improved bandit formula does not solve several important situations (semeais, nakade) in computer Go, we present three other important improvements which are central in the recent progress of our program MoGo: { We show an expert-based improvement of Monte-Carlo simulations for nakade situations; we also emphasize some limitations of this modification. { We show a technique which preserves diversity in the Monte-Carlo simulation, which greatly improves the results in 19x19. { Whereas the UCB-based exploration term is not efficient in MoGo, we show a new exploration term which is highly efficient in MoGo. MoGo recently won a game with handicap 7 against a 9Dan Pro player, Zhou JunXun, winner of the LG Cup 2007, and a game with handicap 6 against a 1Dan pro player, Li-Chen Chien.

Domaines

Optimisation et contrôle [math.OC]

Fichier principal

peacg.pdf (388.17 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Olivier Teytaud : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00386477

Soumis le : jeudi 21 mai 2009-22:29:55

Dernière modification le : vendredi 19 avril 2024-14:42:32

Archivage à long terme le : lundi 15 octobre 2012-10:51:28

Dates et versions

inria-00386477 , version 1 (21-05-2009)

Identifiants

HAL Id : inria-00386477 , version 1

Citer

Guillaume Chaslot, Christophe Fiter, Jean-Baptiste Hoock, Arpad Rimmel, Olivier Teytaud. Adding expert knowledge and exploration in Monte-Carlo Tree Search. Advances in Computer Games, 2009, Pamplona, Spain. ⟨inria-00386477⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X EC-PARIS CNRS INRIA LIX X-LIX X-DEP-INFO UMR8623 INRIA2 LRI-AO TDS-MACS UNIV-PARIS-SACLAY

294 Consultations

946 Téléchargements

Adding expert knowledge and exploration in Monte-Carlo Tree Search

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager