Minimax policies for adversarial and stochastic bandits, Conference on Learning Theory, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-00834882
Finite-time analysis of the multiarmed bandit problem, Machine Learning, vol.47, pp.235-256, 2002. ,
The nonstochastic multi-armed bandit problem, Journal on Computing, vol.32, issue.1, pp.48-77, 2002. ,