Mohex wins hex tournament, ICGA journal, pp.114-116, 2009. ,
Finite-time analysis of the multiarmed bandit problem, Machine Learning, vol.47, issue.2/3, pp.235-256, 2002. ,
DOI : 10.1023/A:1013689704352
Dynamic Programming, 1957. ,
Continuous Upper Confidence Trees, LION'11: Proceedings of the 5th International Conference on Learning and Intelligent OptimizatioN, p.page TBA, 2011. ,
DOI : 10.1016/0196-8858(85)90002-8
URL : https://hal.archives-ouvertes.fr/hal-00835352
Computing elo ratings of move patterns in the game of go, Computer Games Workshop, 2007. ,
URL : https://hal.archives-ouvertes.fr/inria-00149859
Bandit-based optimization on graphs with application to library performance tuning, Proceedings of the 26th Annual International Conference on Machine Learning, ICML '09, pp.729-736, 2009. ,
DOI : 10.1145/1553374.1553468
URL : https://hal.archives-ouvertes.fr/inria-00379523
Combining online and offline knowledge in UCT, Proceedings of the 24th international conference on Machine learning, ICML '07, pp.273-280, 2007. ,
DOI : 10.1145/1273496.1273531
URL : https://hal.archives-ouvertes.fr/inria-00164003
Bandit Based Monte-Carlo Planning, 15th European Conference on Machine Learning (ECML), pp.282-293, 2006. ,
DOI : 10.1007/11871842_29
The Computational Intelligence of MoGo Revealed in Taiwan's Computer Go Tournaments, IEEE Transactions on Computational Intelligence and AI in games, 2009. ,
Les Réserves et la Régulation de l'Avenir dans la vie Economique, 1946. ,
Monte-carlo exploration for deterministic planning, IJCAI, pp.1766-1771, 2009. ,
Optimal robust expensive optimization is tractable, Proceedings of the 11th Annual conference on Genetic and evolutionary computation, GECCO '09, 2009. ,
DOI : 10.1145/1569901.1570255
URL : https://hal.archives-ouvertes.fr/inria-00374910
Reinforcement learning, 1998. ,
DOI : 10.1007/978-1-4615-3618-5
URL : https://hal.archives-ouvertes.fr/hal-00764281
Creating an Upper-Confidence-Tree Program for Havannah, ACG 12, 2009. ,
DOI : 10.1007/978-3-642-12993-3_7
URL : https://hal.archives-ouvertes.fr/inria-00380539
On the use of low discrepancy sequences in Monte Carlo methods, Monte Carlo Methods and Applications, vol.2, issue.4, 1996. ,
DOI : 10.1515/mcma.1996.2.4.295
Algorithms for infinitely manyarmed bandits, Advances in Neural Information Processing Systems, 2008. ,