Minimax policies for adversarial and stochastic bandits, Conference on Learning Theory, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-00834882
Finite-time analysis of the multiarmed bandit problem, Machine Learning, vol.47, pp.235-256, 2002. ,
The nonstochastic multi-armed bandit problem, Journal on Computing, vol.32, issue.1, pp.48-77, 2002. ,
Stochastic multi-armed bandit problem with non-stationary rewards, Neural Information Processing Systems, 2014. ,
Learning from time-changing data with adaptive windowing, International Conference on Data Mining, 2007. ,
Multi-armed bandit problem with known trend, Neurocomputing, vol.205, issue.C, pp.16-21, 2016. ,