Finite-time analysis of the multiarmed bandit problem, Machine Learning, vol.47, issue.23, pp.235-256, 2002. ,
Denumerable-armed bandits, Econometrica, vol.60, issue.5, pp.1071-96, 1992. ,
Bayesian generation and integration of k-nearest-neighbor patterns for 19x19 go, IEEE 2005 Symposium on Computational Intelligence in Games, pp.176-181, 2005. ,
Progressive strategies for monte-carlo tree search, Proceedings of the 10th Joint Conference on Information Sciences, pp.655-661, 2007. ,
Bandit algorithms for tree search, Proceedings of UAI'07, 2007. ,
Robbing the bandit : less regret in online geometric optimization against an adaptive adversary, SODA '06 : Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm, pp.937-943, 2006. ,
Combining online and offline knowledge in uct, ICML '07 : Proceedings of the 24th international conference on Machine learning, pp.273-280, 2007. ,
Bandit-based monte-carlo planning, ECML'06, pp.282-293, 2006. ,
Asymptotically efficient adaptive allocation rules, Advances in applied mathematics, vol.6, pp.4-22, 1985. ,
Svm and pattern-enriched common fate graphs for the game of go, Proceedings of ESANN 2005, pp.485-490, 2005. ,
Modifications of UCT and sequence-like simulations for Monte-Carlo Go, IEEE Symposium on Computational Intelligence and Games, pp.175-182, 2007. ,