S. Agrawal and N. Goyal, Analysis of Thompson sampling for the multi-armed bandit problem, Proceedings of the 25th Conference on Learning Theory (CoLT), pp.1-26, 2012.

J. , Y. Audibert, and S. Bubeck, Best-arm identification in multi-armed bandits, Proceedings of the 23rd Conference on Learning Theory (CoLT), 2010.
URL : https://hal.archives-ouvertes.fr/hal-00654404

P. Auer, N. Cesa-bianchi, and P. Fischer, Finite-time analysis of the multi-armed bandit problem, Machine Learning Journal, vol.47, issue.23, pp.235-256, 2002.

M. Aziz, J. Anderton, E. Kaufmann, and J. Aslam, Pure exploration in infinitely-armed bandit models with fixed-confidence, Proceedings of the 29th International Conference on Algorithmic Learning Theory (ALT), 2018.
URL : https://hal.archives-ouvertes.fr/hal-01729969

M. Aziz, K. Jamieson, and J. Aslam, Pure-exploration for infinite-armed bandits with general arm reservoirs, 2018.

L. Peter, V. Bartlett, M. Gabillon, and . Valko, A simple parameter-free and adaptive approach to optimization under a minimal local smoothness assumption, Proceedings of the 30th International Conference on Algorithmic Learning Theory (ALT), 2019.

J. Bergstra, R. Bardenet, Y. Bengio, and B. Kégl, Algorithms for hyperparameter optimization, Advances in Neural Information Processing Systems 24 (NIPS), pp.2546-2554, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00642998

D. A. Berry, R. W. Chen, A. Zame, D. C. Heath, and L. A. Shepp, Bandit problems with infinitely many arms, Annals of Statistics, vol.25, issue.5, pp.2103-2116, 1997.

S. Bubeck, R. Munos, and G. Stoltz, Pure exploration in multi-armed bandits problems, Proceedings of the 20th International Conference on Algorithmic Learning Theory (ALT), pp.23-37, 2009.

S. Bubeck, R. Munos, G. Stoltz, and C. Szepesvari, X -armed bandits, Journal of Machine Learning Research, vol.12, pp.1587-1627, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00450235

A. Carpentier and M. Valko, Simple regret for infinitely many armed bandits, Proceedings of the 32nd International conference on Machine Learning (ICML), pp.1133-1141, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01153538

D. Dua and K. E. Taniskidou, UCI machine learning repository, 2017.

S. Eyal-even-dar, Y. Mannor, and . Mansour, Action elimination and stopping conditions for reinforcement learning, Proceedings of the 20th International Conference on Machine Learning (ICML), pp.162-169, 2003.

S. Falkner, A. Klein, and F. Hutter, BOHB: Robust and efficient hyperparameter optimization at scale, Proceedings of the 35th International Conference on Machine Learning (ICML), 2018.

V. Gabillon, M. Ghavamzadeh, and A. Lazaric, Best-arm identification: A unified approach to fixed budget and fixed confidence, Advances in Neural Information Processing Systems 25 (NIPS), pp.3212-3220, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00772615

J. Grill, M. Valko, and R. Munos, Black-box optimization of noisy functions with unknown smoothness, Advances in Neural Information Processing Systems 28 (NIPS), pp.667-675, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01222915

M. W. Hoffman, B. Shahriari, and N. De-freitas, On correlation and budget constraints in model-based bandit optimization with application to automatic machine learning, Proceedings of the 17th International Conference on Artificial Intelligence and Statistics (AIStats), pp.365-374, 2014.

F. Hutter, H. Holger, K. Hoos, and . Leyton-brown, Sequential model-based optimization for general algorithm configuration, Proceedings of the 5th International Conference on Learning and Intelligent Optimization (LION), pp.507-523, 2011.

D. R. Jones, M. Schonlau, and W. J. Welch, Efficient global optimization of expensive black-box functions, Journal of Global Optimization, vol.13, issue.4, pp.455-492, 1998.

Z. Karnin, T. Koren, and O. Somekh, Almost optimal exploration in multiarmed bandits, Proceedings of the 30th International Conference on Machine Learning (ICML), pp.1238-1246, 2013.

A. Klein, S. Falkner, N. Mansur, and F. Hutter, RoBO: A flexible and robust Bayesian optimization framework in Python, 7th Workshop on Bayesian Optimization at Neural Information Processing Systems, 2017.

N. Knudde, J. Van-der-herten, T. Dhaene, and I. Couckuyt, GPflowOpt: A Bayesian optimization library using TensorFlow, 2017.

T. Lattimore and C. Szepesvári, Bandit algorithms, 2019.

Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, Gradient-based learning applied to document recognition, Proceedings of the IEEE, vol.86, issue.11, pp.2278-2324, 1998.

L. Li, K. Jamieson, G. Desalvo, A. Talwalkar, and A. Rostamizadeh, Hyperband: Bandit-based configuration evaluation for hyperparameter optimization, Proceedings of the 5th International Conference on Learning Representations (ICLR, 2017.