A. P. Cesa-bianchi-n and . Fischer-p, Finite-time analysis of the multiarmed bandit problem, Machine Learning, vol.47, issue.23, pp.235-256, 2002.

A. P. Cesa-bianchi-n, . Freund-y, and . E. Schapire-r, Gambling in a rigged casino: the adversarial multi-armed bandit problem, Proceedings of the 36th Annual Symposium on Foundations of Computer Science, pp.322-331, 1995.

A. P. Ortner-r, Logarithmic online regret bounds for undiscounted reinforcement learning, Advances in Neural Information Processing Systems 19, 2007.

C. Lugosi-g, Prediction, learning, and games, 2006.

C. G. Muthukrishnan-s, An improved data stream summary: the count-min sketch and its applications, J. Algorithms, vol.55, issue.1, pp.58-75, 2005.

D. V. and T. A. Veeravalli-v, Multihypothesis sequential probability ratio tests: accurate asymptotic expansions for the expected sample size, 1999.

G. M. and M. D. Grobelnik-m, User profiling for interestfocused browsing history, Proceedings of UserSWeb05, 2005.

H. O. Moustakides-g, Optimal and asymptotically optimal cusum rules for change point detection in the brownian motion model with multiple alternatives, Theory of Probability and its Applications, pp.131-144, 2006.

K. D. Ben- and D. S. Gehrke-j, Detecting change in data streams, Proc. VLDB'04, pp.180-191, 2004.

K. L. Szepesvari-c, Reduced-variance payoff estimation in adversarial bandit problems, Proceedings of the ECML-2005 Workshop on Reinforcement Learning in Non-Stationary Environments, 2005.

K. L. Szepesvari-c, Discounted-UCB, 2nd Pascal-Challenge Workshop, 2006.

M. H. , M. D. , and M. N. Sefouhi-l, Test of Page-Hinkley, an approach for fault detection in an agro-alimentary production system, 5th Asian Control Conference, pp.815-818, 2004.

P. G. , C. F. Bouthemy-p, and . Yao-j.-f, Détection supervisée d'´ evénementsevénements`evénementsà l'aide d'une modélisation probabiliste du mouvement perçu, 14e Congrès Francophone AFRIF-AFIA de Reconnaissance des Formes et Intelligence Artificielle, 2004.