Nash Reweighting of Monte Carlo Simulations: Tsumego - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

Nash Reweighting of Monte Carlo Simulations: Tsumego

Résumé

Monte Carlo simulations are widely accepted as a tool for evaluating positions in games. It can be used inside tree search algorithms, simple Monte Carlo search, Nested Monte Carlo and the famous Monte Carlo Tree Search algorithm which is at the heart of the current revolution in computer games. If one has access to a perfect simulation policy, then there is no need for an estimation of the game value. In any other cases, an evaluation through Monte Carlo simulations is a possible approach. However, games simulations are, in practice, biased. Many papers are devoted to improve Monte Carlo simulation policies by reducing this bias. In this paper, we propose a complementary tool: instead of modifying the simulations, we modify the way they are averaged by adjusting weights. We apply our method to MCTS for Tsumego solving. In particular, we improve Gnugo-MCTS without any online computational overhead.
Fichier non déposé

Dates et versions

hal-01245520 , version 1 (17-12-2015)

Identifiants

Citer

David L. St-Pierre, Jialin Liu, Olivier Teytaud. Nash Reweighting of Monte Carlo Simulations: Tsumego. 2015 IEEE Congress on Evolutionary Computation (IEEE CEC 2015), May 2015, Sendai, Japan. pp.1458 - 1465, ⟨10.1109/CEC.2015.7257060⟩. ⟨hal-01245520⟩
171 Consultations
1 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More