Y. Lecun, K. Kavukcuoglu, and C. Farabet, Convolutional networks and applications in vision. In: International Symposium on Circuits and Systems, pp.253-256, 2010.

T. Williams and R. Li, Wavelet pooling for convolutional neural networks, International Conference on Learning Representations, 2018.

O. Rippel, J. Snoek, and R. P. Adams, Spectral representations for convolutional neural networks. In: Neural Information Processing Systems, pp.2449-2457, 2015.

Q. V. Le, Building high-level features using large scale unsupervised learning, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.8595-8598, 2013.
DOI : 10.1109/ICASSP.2013.6639343
URL : http://arxiv.org/pdf/1112.6209

A. Krizhevsky, I. Sutskever, and G. E. Hinton, ImageNet classification with deep convolutional neural networks. In: Neural Information Processing Systems, pp.1097-1105, 2012.

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, 2014.

K. He, X. Zhang, S. Ren, and J. Sun, Deep Residual Learning for Image Recognition, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.770-778, 2016.
DOI : 10.1109/CVPR.2016.90

D. A. Forsyth and J. Ponce, Computer vision: a modern approach, 2002.
URL : https://hal.archives-ouvertes.fr/hal-01063327

S. Mallat, A wavelet tour of signal processing, 1999.

S. Mallat and W. L. Hwang, Singularity detection and processing with wavelets, IEEE Transactions on Information Theory, vol.38, issue.2, pp.617-643, 1992.
DOI : 10.1109/18.119727

A. Skodras, C. Christopoulos, and T. Ebrahimi, The JPEG 2000 still image compression standard, IEEE Signal Processing Magazine, vol.18, issue.5, pp.36-58, 2001.
DOI : 10.1109/79.952804

F. Perronnin and D. Larlus, Fisher vectors meet Neural Networks: A hybrid classification architecture, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.3743-3752, 2015.
DOI : 10.1109/CVPR.2015.7298998

S. Fujieda, K. Takayama, and T. Hachisuka, Wavelet convolutional neural networks for texture classification, 2017.

G. Huang, Z. Liu, K. Q. Weinberger, and L. Van-der-maaten, Densely Connected Convolutional Networks, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
DOI : 10.1109/CVPR.2017.243
URL : http://arxiv.org/pdf/1608.06993

A. Levinskis, Convolutional Neural Network Feature Reduction using Wavelet Transform, Electronics and Electrical Engineering, vol.19, issue.3, pp.61-64, 2013.
DOI : 10.5755/j01.eee.19.3.3698

L. Gueguen, A. Sergeev, R. Liu, and J. Yosinski, Faster neural networks straight from JPEG. In: International Conference on Learning Representations Workshop, 2018.

E. Oyallon, E. Belilovsky, and S. Zagoruyko, Scaling the Scattering Transform: Deep Hybrid Networks, 2017 IEEE International Conference on Computer Vision (ICCV), 2017.
DOI : 10.1109/ICCV.2017.599
URL : https://hal.archives-ouvertes.fr/hal-01495734

J. Bruna and S. Mallat, Invariant Scattering Convolution Networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, issue.8, pp.1872-1886, 2013.
DOI : 10.1109/TPAMI.2012.230

J. Sánchez, F. Perronnin, T. Mensink, and J. Verbeek, Image Classification with the Fisher Vector: Theory and Practice, International Journal of Computer Vision, vol.73, issue.2, pp.222-245, 2013.
DOI : 10.1007/s11263-006-9794-4

J. M. Morel and G. Yu, ASIFT: A New Framework for Fully Affine Invariant Image Comparison, SIAM Journal on Imaging Sciences, vol.2, issue.2, pp.438-469, 2009.
DOI : 10.1137/080732730

B. A. Olshausen and D. J. Field, Emergence of simple-cell receptive field properties by learning a sparse code for natural images, Nature, vol.381, issue.6583, p.607, 1996.
DOI : 10.1038/381607a0

S. Mallat, Group Invariant Scattering, Communications on Pure and Applied Mathematics, vol.37, issue.10, pp.1331-1398, 2012.
DOI : 10.1137/S0036141002404838

S. Mallat and I. Waldspurger, Phase Retrieval for the Cauchy Wavelet Transform, Journal of Fourier Analysis and Applications, vol.10, issue.3, pp.1251-1309, 2015.
DOI : 10.1080/713817747
URL : https://hal.archives-ouvertes.fr/hal-01645090

I. Waldspurger, A. Aspremont, and S. Mallat, Phase recovery, MaxCut and complex semidefinite programming, Mathematical Programming, vol.16, issue.3, pp.47-81, 2015.
DOI : 10.1137/04061341X
URL : https://hal.archives-ouvertes.fr/hal-00907535

K. Krajsek and R. Mester, A Unified Theory for Steerable and Quadrature Filters, In: Advances in Computer Graphics and Computer Vision, vol.10, issue.2, pp.201-214, 2007.
DOI : 10.1109/83.902274

R. Soulard, Ondelettes analytiques et monogènes pour la représentation des images couleur, 2012.

N. Delprat, B. Escudié, P. Guillemain, R. Kronland-martinet, P. Tchamitchian et al., Asymptotic wavelet and Gabor analysis: extraction of instantaneous frequencies, IEEE Transactions on Information Theory, vol.38, issue.2, pp.644-664, 1992.
DOI : 10.1109/18.119728
URL : https://hal.archives-ouvertes.fr/hal-01222729

J. Bruna and S. Mallat, Audio texture synthesis with scattering moments, 2013.

J. Bruna, Scattering representations for recognition, 2013.
URL : https://hal.archives-ouvertes.fr/pastel-00905109

T. Lindeberg, Feature detection with automatic scale selection, International Journal of Computer Vision, vol.30, issue.2, pp.79-116, 1998.
DOI : 10.1023/A:1008045108935

D. G. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94

S. Zagoruyko and N. Komodakis, Wide residual networks. In: British Machine Vision Conference, 2016.

R. Torfason, F. Mentzer, E. Agustsson, M. Tschannen, R. Timofte et al., Towards image understanding from deep compression without decoding, 2018.

J. Yang, J. Lu, D. Batra, and D. Parikh, A faster pytorch implementation of faster R-CNN. https, 2017.

R. B. Girshick, Fast R-CNN. International Conference on Computer Vision, pp.1440-1448, 2015.

S. Ren, K. He, R. Girshick, and J. Sun, Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.39, issue.6, pp.1137-1149, 2017.
DOI : 10.1109/TPAMI.2016.2577031

S. Ioffe and C. Szegedy, Batch normalization: Accelerating deep network training by reducing internal covariate shift, 2015.

M. Everingham, L. Van-gool, C. K. Williams, J. Winn, and A. Zisserman, The Pascal Visual Object Classes (VOC) Challenge, International Journal of Computer Vision, vol.73, issue.2, pp.303-338, 2010.
DOI : 10.1371/journal.pcbi.0040027

A. Paszke, S. Gross, S. Chintala, G. Chanan, E. Yang et al., Automatic differentiation in pytorch, 2017.

K. He, G. Gkioxari, P. Dollár, and R. Girshick, Mask R-CNN. In: International Conference Computer Vision, pp.2980-2988, 2017.

T. Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona et al., Microsoft COCO: Common objects in context. In: European Conference on Computer Vision, pp.740-755, 2014.