N. Alon, N. Cesa-bianchi, C. Gentile, and Y. Mansour, From Bandits to Experts: A Tale of Domination and Independence, Neural Information Processing Systems, 2013.

J. Y. Audibert, S. Bubeck, and G. Lugosi, Regret in Online Combinatorial Optimization, Mathematics of Operations Research, vol.39, issue.1, pp.31-45, 2014.
DOI : 10.1287/moor.2013.0598

P. Auer, N. Cesa-bianchi, Y. Freund, and R. E. Schapire, The Nonstochastic Multiarmed Bandit Problem, SIAM Journal on Computing, vol.32, issue.1, pp.48-77, 2002.
DOI : 10.1137/S0097539701398375

P. Auer, N. Cesa-bianchi, and C. Gentile, Adaptive and Self-Confident On-Line Learning Algorithms, Journal of Computer and System Sciences, vol.64, issue.1, pp.48-75, 2002.
DOI : 10.1006/jcss.2001.1795

N. Cesa-bianchi, Y. Freund, D. Haussler, D. Helmbold, R. Schapire et al., How to use expert advice, Journal of the ACM, vol.44, issue.3, pp.427-485, 1997.
DOI : 10.1145/258128.258179

N. Cesa-bianchi and G. Lugosi, Combinatorial bandits, Journal of Computer and System Sciences, vol.78, issue.5, pp.1404-1422, 2012.
DOI : 10.1016/j.jcss.2012.01.001

W. Chen, Y. Wang, and Y. Yuan, Combinatorial Multi-Armed Bandit: General Framework and Applications, International Conference on Machine Learning, pp.151-159, 2013.

L. Györfi and B. Ottucsák, Sequential Prediction of Unbounded Stationary Time Series, IEEE Transactions on Information Theory, vol.53, issue.5, pp.866-1872, 2007.
DOI : 10.1109/TIT.2007.894660

J. Hannan, Approximation to Bayes Risk in Repeated Play. Contributions to the theory of games, pp.97-139, 1957.

M. Hutter and J. Poland, Prediction with Expert Advice by Following the Perturbed Leader for General Weights, Algorithmic Learning Theory, pp.279-293, 2004.
DOI : 10.1007/978-3-540-30215-5_22

A. Kalai and S. Vempala, Efficient algorithms for online decision problems, Journal of Computer and System Sciences, vol.71, issue.3, pp.291-307, 2005.
DOI : 10.1016/j.jcss.2004.10.016

W. M. Koolen, M. K. Warmuth, and J. And-kivinen, Hedging structured concepts, Proceedings of the 23rd Annual Conference on Learning Theory (COLT), pp.93-105, 2010.

N. Littlestone and M. Warmuth, The Weighted Majority Algorithm, Information and Computation, vol.108, issue.2, pp.212-261, 1994.
DOI : 10.1006/inco.1994.1009

S. Mannor and O. Shamir, From Bandits to Experts: On the Value of Side- Observations, Neural Information Processing Systems, 2011.

G. Neu and G. Bartók, An Efficient Algorithm for Learning with Semi-bandit Feedback, Algorithmic Learning Theory, pp.234-248, 2013.
DOI : 10.1007/978-3-642-40935-6_17

V. Vovk, AGGREGATING STRATEGIES, Proceedings of the third annual workshop on Computational learning theory (COLT), pp.371-386, 1990.
DOI : 10.1016/B978-1-55860-146-8.50032-1