P. Alquier and B. Guedj, Simpler PAC-Bayesian bounds for hostile data, Machine Learning, vol.107, pp.887-902, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01385064

J. Audibert, PAC-Bayesian statistical learning theory, vol.6, p.29, 2004.

J. Audibert, R. Munos, and C. Szepesvári, Tuning bandit algorithms in stochastic environments, International conference on algorithmic learning theory, pp.150-165, 2007.
URL : https://hal.archives-ouvertes.fr/inria-00203487

L. Peter, M. I. Bartlett, J. D. Jordan, and . Mcauliffe, Convexity, classification, and risk bounds, Journal of the American Statistical Association, vol.101, issue.473, pp.138-156, 2006.

L. Peter, S. Bartlett, and . Mendelson, Empirical minimization. Probability Theory and Related Fields, vol.135, pp.311-334, 2006.

O. Bousquet and A. Elisseeff, Stability and generalization, Journal of machine learning research, vol.2, pp.499-526, 2002.

O. Catoni, A PAC-Bayesian approach to adaptive classification, 2003.

O. Catoni, PAC-Bayesian Supervised Classification, Lecture Notes-Monograph Series. IMS, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00206119

O. Catoni, PAC-Bayesian supervised classification: the thermodynamics of statistical learning, Lecture Notes-Monograph Series. IMS, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00206119

N. Cesa-bianchi, Y. Mansour, and G. Stoltz, Improved second-order bounds for prediction with expert advice, Machine Learning, vol.66, pp.321-352, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00019799

N. Cesa, -. Bianchi, and G. Lugosi, Prediction, Learning and Games, 2006.

D. Dua and C. Graff, UCI machine learning repository, 2017.

K. Gintare, D. M. Dziugaite, and . Roy, Computing nonvacuous generalization bounds for deep (stochastic) neural networks with many more parameters than training data, UAI, 2017.

T. Van-erven, N. A. Mehta, M. D. Reid, and R. C. Williamson, Fast rates in statistical and online learning, Journal of Machine Learning Research, vol.16, pp.1793-1861, 2015.

X. Fan, I. Grama, and Q. Liu, Exponential inequalities for martingales with applications, Electronic Journal of Probability, vol.20, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01108032

P. Germain, A. Lacasse, F. Laviolette, and M. Marchand, PAC-Bayesian learning of linear classifiers, Proceedings of the 26th Annual International Conference on Machine Learning, pp.353-360, 2009.

P. Germain, A. Lacasse, F. Laviolette, M. Marchand, and J. Roy, Risk bounds for the majority vote: From a pac-bayesian analysis to a learning algorithm, The Journal of Machine Learning Research, vol.16, issue.1, pp.787-860, 2015.

D. Peter, N. A. Grünwald, and . Mehta, Fast rates for general unbounded loss functions: from ERM to generalized Bayes, Journal of Machine Learning Research, 2019.

B. Guedj, A primer on PAC-Bayesian learning, 2019.
URL : https://hal.archives-ouvertes.fr/hal-01983732

A. Steven-r-howard, J. Ramdas, J. Mcauliffe, and . Sekhon, Uniform, nonparametric, non-asymptotic confidence sequences, 2018.

M. Wouter, P. D. Koolen, T. Grünwald, and . Van-erven, Combining adversarial guarantees and stochastic fast rates in online learning, Advances in Neural Information Processing Systems, pp.4457-4465, 2016.

J. Langford and R. Caruana, Not) bounding the true error, Advances in Neural Information Processing Systems, vol.14, pp.809-816, 2002.

J. Langford and J. Shawe-taylor, PAC-Bayes & margins, Advances in Neural Information Processing Systems, pp.439-446, 2003.

P. Massart and É. Nédélec, Risk bounds for statistical learning. The Annals of Statistics, vol.34, pp.2326-2366, 2006.

A. Maurer, A note on the PAC-Bayesian theorem, 2004.

A. Maurer and M. Pontil, Empirical Bernstein bounds and sample variance penalization, Proceedings COLT 2009, 2009.

D. A. Mcallester, Some PAC-Bayesian theorems, Proceedings of the Eleventh ACM Conference on Computational Learning Theory (COLT' 98), pp.230-234, 1998.

D. A. Mcallester, PAC-Bayesian model averaging, Proceedings of the Twelfth ACM Conference on Computational Learning Theory (COLT' 99), pp.164-171, 1999.

D. A. Mcallester, PAC-Bayesian stochastic model selection, Machine Learning, vol.51, pp.5-21, 2003.

A. Nishant and . Mehta, Fast rates with high probability in exp-concave statistical learning, Artificial Intelligence and Statistics, pp.1085-1093, 2017.

V. Mnih, C. Szepesvári, and J. Audibert, Empirical bernstein stopping, Proceedings of the 25th international conference on Machine learning, pp.672-679, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00834983

S. Mukherjee, P. Niyogi, T. Poggio, and R. Rifkin, Learning theory: stability is sufficient for generalization and necessary and sufficient for consistency of empirical risk minimization, Advances in Computational Mathematics, vol.25, issue.1-3, pp.161-193, 2006.

S. Behnam-neyshabur, D. A. Bhojanapalli, N. Mcallester, and . Srebro, A PAC-Bayesian approach to spectrally-normalized margin bounds for neural networks, In ICLR, 2018.

O. Rivasplata, C. Szepesvári, J. S. Shawe-taylor, E. Parrado-hernandez, and S. Sun, Pac-bayes bounds for stable algorithms with instance-dependent priors, Advances in Neural Information Processing Systems, pp.9214-9224, 2018.

M. Seeger, PAC-Bayesian generalisation error bounds for Gaussian process classification, Journal of machine learning research, vol.3, pp.233-269, 2002.

M. Seeger, PAC-Bayesian generalization error bounds for Gaussian process classification, Journal of Machine Learning Research, vol.3, pp.233-269, 2002.

S. Shalev-shwartz, O. Shamir, N. Srebro, and K. Sridharan, Learnability, stability and uniform convergence, Journal of Machine Learning Research, vol.11, pp.2635-2670, 2010.

O. Ilya, Y. Tolstikhin, and . Seldin, PAC-Bayes-empirical-Bernstein inequality, Advances in Neural Information Processing Systems, pp.109-117, 2013.

A. B. Tsybakov, Optimal aggregation of classifiers in statistical learning, The Annals of Statistics, vol.32, issue.1, pp.135-166, 2004.
URL : https://hal.archives-ouvertes.fr/hal-00102142

O. Wintenberger, Optimal learning with bernstein online aggregation, Machine Learning, vol.106, pp.119-141, 2017.
URL : https://hal.archives-ouvertes.fr/hal-00973918

W. Zhou, V. Veitch, M. Austern, R. P. Adams, and P. Orbanz, Nonvacuous generalization bounds at the ImageNet scale: a PAC-Bayesian compression approach, ICLR, 2019.