B. Alipanahi, A. Delong, M. T. Weirauch, and B. J. Frey, Predicting the sequence specificities of dna-and rna-binding proteins by deep learning, Nature biotechnology, vol.33, issue.8, p.831, 2015.

S. F. Altschul, T. L. Madden, A. A. Schäffer, J. Zhang, Z. Zhang et al., Gapped blast and psi-blast: a new generation of protein database search programs, Nucleic acids research, vol.25, issue.17, pp.3389-3402, 1997.

M. Arbel, D. J. Sutherland, M. Bi?kowski, and A. Gretton, On gradient regularizers for MMD GANs, Advances in Neural Information Processing Systems (NeurIPS), 2018.

M. Arjovsky, S. Chintala, L. Bottou, and G. Wasserstein, Proceedings of the International Conference on Machine Learning (ICML), 2017.

P. L. Bartlett, D. J. Foster, and M. J. Telgarsky, Spectrally-normalized margin bounds for neural networks, Advances in Neural Information Processing Systems (NIPS), 2017.

M. Belkin, D. Hsu, and P. Mitra, Overfitting or perfect fitting? risk bounds for classification and regression rules that interpolate, Advances in Neural Information Processing Systems (NeurIPS), 2018.

M. Belkin, S. Ma, and S. Mandal, To understand deep learning we need to understand kernel learning, Proceedings of the International Conference on Machine Learning (ICML), 2018.

A. Bietti and J. Mairal, Group invariance, stability to deformations, and complexity of deep convolutional representations, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01536004

B. Biggio and F. Roli, Wild patterns: Ten years after the rise of adversarial machine learning, Pattern Recognition, vol.84, pp.317-331, 2018.

M. Bi?kowski, D. J. Sutherland, M. Arbel, A. Gretton, . Demystifying et al., Proceedings of the International Conference on Learning Representations (ICLR), 2018.

S. Boucheron, O. Bousquet, and G. Lugosi, Theory of classification: A survey of some recent advances, ESAIM: probability and statistics, vol.9, pp.323-375, 2005.
URL : https://hal.archives-ouvertes.fr/hal-00017923

O. Chapelle, B. Schölkopf, and A. Zien, Semi-Supervised Learning, 2006.

T. Ching, Opportunities and obstacles for deep learning in biology and medicine, Journal of The Royal Society Interface, vol.15, issue.141, 2018.

M. Cisse, P. Bojanowski, E. Grave, Y. Dauphin, and N. Usunier, Parseval networks: Improving robustness to adversarial examples, Proceedings of the International Conference on Machine Learning (ICML), 2017.

H. Drucker and Y. Le-cun, Double backpropagation increasing generalization performance, International Joint Conference on Neural Networks (IJCNN), 1991.
DOI : 10.1109/ijcnn.1991.155328

G. K. Dziugaite, D. M. Roy, and Z. Ghahramani, Training generative neural networks via maximum mean discrepancy optimization, Conference on Uncertainty in Artificial Intelligence (UAI), 2015.

L. Engstrom, D. Tsipras, L. Schmidt, and A. Madry, A rotation and a translation suffice: Fooling cnns with simple transformations, 2017.

A. Gretton, K. M. Borgwardt, M. J. Rasch, B. Schölkopf, and A. Smola, A kernel two-sample test, Journal of Machine Learning Research, vol.13, pp.723-773, 2012.

I. Gulrajani, F. Ahmed, M. Arjovsky, V. Dumoulin, and A. C. Courville, Improved training of Wasserstein GANs, Advances in Neural Information Processing Systems (NIPS), 2017.

T. Håndstad, A. J. Hestnes, and P. Saetrom, Motif kernel generated by genetic programming improves remote homology and fold detection, BMC bioinformatics, vol.8, issue.1, p.23, 2007.

K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learning for image recognition, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.

S. M. Kakade, K. Sridharan, and A. Tewari, On the complexity of linear prediction: Risk bounds, margin bounds, and regularization, Advances in Neural Information Processing Systems (NIPS), 2009.

V. Koltchinskii and D. Panchenko, Empirical margin distributions and bounding the generalization error of combined classifiers. The Annals of Statistics, vol.30, pp.1-50, 2002.

C. Li, W. Chang, Y. Cheng, Y. Yang, and B. Póczos, Mmd gan: Towards deeper understanding of moment matching network, Advances in Neural Information Processing Systems (NIPS), 2017.

G. Loosli, S. Canu, and L. Bottou, Training invariant support vector machines using selective sampling, Large Scale Kernel Machines, pp.301-320, 2007.

C. Lyu, K. Huang, and H. Liang, A unified gradient regularization family for adversarial examples, IEEE International Conference on Data Mining (ICDM), 2015.

A. Madry, A. Makelov, L. Schmidt, D. Tsipras, and A. Vladu, Towards deep learning models resistant to adversarial attacks, Proceedings of the International Conference on Learning Representations (ICLR), 2018.

J. , End-to-end kernel learning with supervised convolutional kernel networks, Advances in Neural Information Processing Systems (NIPS), 2016.

T. Miyato, T. Kataoka, M. Koyama, and Y. Yoshida, Spectral normalization for generative adversarial networks, Proceedings of the International Conference on Learning Representations (ICLR), 2018.

T. Miyato, S. Maeda, S. Ishii, and M. Koyama, Virtual adversarial training: a regularization method for supervised and semi-supervised learning, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2018.

A. G. Murzin, S. E. Brenner, T. Hubbard, and C. Chothia, Scop: a structural classification of proteins database for the investigation of sequences and structures, Journal of molecular biology, vol.247, issue.4, pp.536-540, 1995.

B. Neyshabur, S. Bhojanapalli, D. Mcallester, and N. Srebro, A PAC-Bayesian approach to spectrallynormalized margin bounds for neural networks, Proceedings of the International Conference on Learning Representations (ICLR), 2018.

E. Oyallon, E. Belilovsky, and S. Zagoruyko, Scaling the scattering transform: Deep hybrid networks, International Conference on Computer Vision (ICCV, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01495734

A. Raghunathan, J. Steinhardt, and P. Liang, Certified defenses against adversarial examples, Proceedings of the International Conference on Learning Representations (ICLR), 2018.

K. Roth, A. Lucchi, S. Nowozin, and T. Hofmann, Stabilizing training of generative adversarial networks through regularization, Advances in Neural Information Processing Systems (NIPS), 2017.

K. Roth, A. Lucchi, S. Nowozin, and T. Hofmann, Adversarially robust training through structured gradient regularization, 2018.

L. Schmidt, S. Santurkar, D. Tsipras, K. Talwar, and A. M?dry, Adversarially robust generalization requires more data, Advances in Neural Information Processing Systems (NeurIPS), 2018.

B. Schölkopf and A. J. Smola, Learning with kernels: support vector machines, regularization, optimization, and beyond, 2001.

H. Sedghi, V. Gupta, and P. M. Long, The singular values of convolutional layers, 2018.

P. Y. Simard, Y. A. Lecun, J. S. Denker, and B. Victorri, Transformation invariance in pattern recognitiontangent distance and tangent propagation, Neural networks: tricks of the trade, pp.239-274, 1998.

C. Simon-gabriel, Y. Ollivier, B. Schölkopf, L. Bottou, and D. Lopez-paz, Adversarial vulnerability of neural networks increases with input dimension, 2018.

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, Proceedings of the International Conference on Learning Representations (ICLR), 2014.

A. Sinha, H. Namkoong, and J. Duchi, Certifying some distributional robustness with principled adversarial training, Proceedings of the International Conference on Learning Representations (ICLR), 2018.

B. K. Sriperumbudur, K. Fukumizu, A. Gretton, B. Schölkopf, and G. R. Lanckriet, On the empirical estimation of integral probability metrics, Electronic Journal of Statistics, vol.6, pp.1550-1599, 2012.

C. Szegedy, W. Zaremba, I. Sutskever, J. Bruna, D. Erhan et al., Intriguing properties of neural networks, 2013.

S. Thrun, Lifelong learning algorithms, Learning to learn, pp.181-209, 1998.

D. Tsipras, S. Santurkar, L. Engstrom, A. Turner, and A. Madry, There is no free lunch in adversarial robustness, 2018.

E. Wong and J. Z. Kolter, Provable defenses against adversarial examples via the convex outer adversarial polytope, Proceedings of the International Conference on Machine Learning (ICML), 2018.

H. Xu, C. Caramanis, and S. Mannor, Robust regression and lasso, Advances in Neural Information Processing Systems (NIPS), 2009.

H. Xu, C. Caramanis, and S. Mannor, Robustness and regularization of support vector machines, Journal of Machine Learning Research (JMLR), vol.10, pp.1485-1510, 2009.

Y. Yoshida and T. Miyato, Spectral norm regularization for improving the generalizability of deep learning, 2017.

Y. Zhang, J. D. Lee, and M. I. Jordan, 1-regularized neural networks are improperly learnable in polynomial time, Proceedings of the International Conference on Machine Learning (ICML), 2016.

Y. Zhang, P. Liang, and M. J. Wainwright, Convexified convolutional neural networks, Proceedings of the International Conference on Machine Learning (ICML), 2017.