Finite-time analysis of the multiarmed bandit problem, Machine Learning, vol.47, issue.23, pp.235-256, 2002. ,
Gambling in a rigged casino: the adversarial multi-armed bandit problem, Proceedings of the 36th Annual Symposium on Foundations of Computer Science, pp.322-331, 1995. ,
Logarithmic online regret bounds for undiscounted reinforcement learning, Advances in Neural Information Processing Systems 19, 2007. ,
Prediction, learning, and games, 2006. ,
An improved data stream summary: the count-min sketch and its applications, J. Algorithms, vol.55, issue.1, pp.58-75, 2005. ,
Multihypothesis sequential probability ratio tests: accurate asymptotic expansions for the expected sample size, 1999. ,
User profiling for interestfocused browsing history, Proceedings of UserSWeb05, 2005. ,
Optimal and asymptotically optimal cusum rules for change point detection in the brownian motion model with multiple alternatives, Theory of Probability and its Applications, pp.131-144, 2006. ,
Detecting change in data streams, Proc. VLDB'04, pp.180-191, 2004. ,
Reduced-variance payoff estimation in adversarial bandit problems, Proceedings of the ECML-2005 Workshop on Reinforcement Learning in Non-Stationary Environments, 2005. ,
Discounted-UCB, 2nd Pascal-Challenge Workshop, 2006. ,
Test of Page-Hinkley, an approach for fault detection in an agro-alimentary production system, 5th Asian Control Conference, pp.815-818, 2004. ,
Détection supervisée d'´ evénementsevénements`evénementsà l'aide d'une modélisation probabiliste du mouvement perçu, 14e Congrès Francophone AFRIF-AFIA de Reconnaissance des Formes et Intelligence Artificielle, 2004. ,