Simulation-based search of combinatorial games

Lukasz Lew; Rémi Coulom

Communication Dans Un Congrès Année : 2010

Simulation-based search of combinatorial games

(1) , (2)

1
2

Lukasz Lew

Fonction : Auteur
PersonId : 924754

Faculty of Mathematics, Informatics, and Mechanics [Warsaw]

Rémi Coulom

Fonction : Auteur
PersonId : 836867

Sequential Learning

Résumé

Monte-Carlo Tree Search is a very successful game playing algorithm. Unfortunately it su ers from the horizon e ect: some important tactical sequences may be delayed beyond the depth of the search tree, causing evaluation errors. Temporal-di erence search with a function approximation is a method that was proposed to overcome these weaknesses, by adaptively changing the simulation policy outside the tree. In this paper we present an experimental evidence demonstrating that a temporal di erence search may fail to nd an optimal policy, even in very simple game positions. Classical temporal di erence algorithms try to evaluate a local situation with a numerical value, but, as it appears, a single number is not enough to model the dynamics of a partial two-player game state. As a solution we propose to replace numerical values by approximate thermographs. With this richer representation of partial states, reinforcement-learning algorithms converge and accurately represent dynamics of states, allowing to nd an optimal policy.

Domaines

Autre [cs.OH]

Ist Rennes : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00694030

Soumis le : jeudi 3 mai 2012-14:00:03

Dernière modification le : vendredi 24 mars 2023-14:52:55

Dates et versions

hal-00694030 , version 1 (03-05-2012)

Identifiants

HAL Id : hal-00694030 , version 1

Citer

Lukasz Lew, Rémi Coulom. Simulation-based search of combinatorial games. ICML 2010 : Workshop on Machine Learning and Games, Jun 2010, Haifa, Israel. ⟨hal-00694030⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA LAGIS GRID5000 INRIA2 SILECS

115 Consultations

0 Téléchargements

Simulation-based search of combinatorial games

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager