I. The, VocR and AXES submissions to Trecvid 2014 Multimedia Event Detection Matthijs Douze, 2014.

P. Agrawal, J. Carreira, and J. Malik, Learning to See by Moving, 2015 IEEE International Conference on Computer Vision (ICCV), 2015.
DOI : 10.1109/ICCV.2015.13

Z. Akata, F. Perronnin, Z. Harchaoui, and C. Schmid, Good Practice in Large-Scale Learning for Image Classification, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.36, issue.3, 2013.
DOI : 10.1109/TPAMI.2013.146
URL : https://hal.archives-ouvertes.fr/hal-00835810

Z. Akata, F. Perronnin, Z. Harchaoui, and C. Schmid, Label-Embedding for Image Classification, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.38, issue.7, 2015.
DOI : 10.1109/TPAMI.2015.2487986
URL : https://hal.archives-ouvertes.fr/hal-01207145

G. An, The Effects of Adding Noise During Backpropagation Training on a Generalization Performance, Neural Computation, vol.22, issue.3, 1996.
DOI : 10.1162/neco.1989.1.4.425

R. Arandjelovic and A. Zisserman, All About VLAD, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013.
DOI : 10.1109/CVPR.2013.207

A. Babenko and V. Lempitsky, Aggregating deep convolutional features for image retrieval, International Conference on Computer Vision, 2015.

A. Babenko, A. Slesarev, A. Chigorin, and V. Lempitsky, Neural Codes for Image Retrieval, European Conference on Computer Vision, 2014.
DOI : 10.1007/978-3-319-10590-1_38
URL : http://arxiv.org/pdf/1404.1777

P. Baldi and P. J. Sadowski, Understanding dropout, Advances in Neural Information Processing Systems, 2013.

H. Bay, T. Tuytelaars, and L. Van-gool, SURF: Speeded up robust features, European Conference on Computer Vision, 2006.
DOI : 10.1007/11744023_32

Y. Bengio, Learning deep architectures for AI. Foundations and Trends in Machine Learning, 2009.
DOI : 10.1561/2200000006
URL : http://www.iro.umontreal.ca/~bengioy/papers/ftml.pdf

T. Binford and T. Levitt, Quasi-invariants : theory and exploitation, Proceedings of Darpa Image Understanding Workshop, 1993.

C. Bishop, Training with Noise is Equivalent to Tikhonov Regularization, Neural computation, 1995.
DOI : 10.1162/neco.1994.6.1.147

L. Bo, X. Ren, and D. Fox, Kernel descriptors for visual recognition, Advances in Neural Information Processing Systems, 2010.

L. Bottou, Stochastic Gradient Descent Tricks, Neural Networks: Tricks of the Trade, 2012.
DOI : 10.1137/1116025

L. Bottou and O. Bousquet, The tradeoffs of large-scale learning, Advances in Neural Information Processing Systems, 2007.

M. Brown, G. Hua, and S. Winder, Discriminative Learning of Local Image Descriptors, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.33, issue.1, 2011.
DOI : 10.1109/TPAMI.2010.54
URL : http://opus.bath.ac.uk/26111/1/Brown_IEEE%2DTPAMI_2011_33_1_43.pdf

J. B. Burns, R. S. Weiss, and E. M. Riseman, View variation of point-set and line-segment features, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.15, issue.1, 1993.
DOI : 10.1109/34.184774

M. Calonder, V. Lepetit, C. Strecha, and F. , BRIEF: Binary Robust Independent Elementary Features, European Conference on Computer Vision, 2010.
DOI : 10.1007/978-3-642-15561-1_56
URL : http://cvlab.epfl.ch/publications/publications/2010/LepetitF10.pdf

T. Chan, K. Jia, S. Gao, J. Lu, Z. Zeng et al., Pcanet: A simple deep learning baseline for image classification? arXiv preprint, 2014.
DOI : 10.1109/tip.2015.2475625
URL : http://arxiv.org/pdf/1404.3606

K. Chatfield and A. Zisserman, VISOR: Towards On-the-Fly Large-Scale Object Category Retrieval, Asian Conference on Computer Vision, 2012.
DOI : 10.1007/978-3-642-37444-9_34

M. Chen, Z. Xu, K. Weinberger, and F. Sha, Marginalized denoising autoencoders for domain adaptation, International Conference on Machine Learning, 2012.

S. Chopra, R. Hadsell, and Y. Lecun, Learning a Similarity Metric Discriminatively, with Application to Face Verification, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005.
DOI : 10.1109/CVPR.2005.202

R. G. Cinbis, J. Verbeek, and C. Schmid, Multi-fold MIL Training for Weakly Supervised Object Localization, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.309
URL : https://hal.archives-ouvertes.fr/hal-00975746

S. Clinchant, G. Csurka, F. Perronnin, and J. Renders, XRCE's participation to Imageval. ImageEval workshop at CVIR, 2007.

A. Coates and A. Y. Ng, Learning Feature Representations with K-Means, Neural Networks: Tricks of the Trade, 2012.
DOI : 10.1007/978-3-642-15555-0_11

F. Cucker and D. Zhou, Learning theory : an approximation theory viewpoint, Cambridge Monographs on Applied and Computational Mathematics, 2007.
DOI : 10.1017/CBO9780511618796

D. Decoste and M. Burl, Distortion-invariant recognition via jittered queries, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662), 2000.
DOI : 10.1109/CVPR.2000.855893

D. Decoste and B. Schölkopf, Training invariant support vector machines, Machine Learning, 2002.

J. Deng, W. Dong, R. Socher, L. Li, K. Li et al., ImageNet: A largescale hierarchical image database, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009.

J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang et al., DeCAF: A deep convolutional activation feature for generic visual recognition, ICML, 2014.

J. Dong and S. Soatto, Domain-size pooling in local descriptors: DSP-SIFT, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
DOI : 10.1109/CVPR.2015.7299145
URL : http://arxiv.org/pdf/1412.8556

D. Erhan, P. Manzagol, Y. Bengio, S. Bengio, and P. Vincent, The difficulty of training deep architectures and the effect of unsupervised pre-training, Twelfth International Conference on Artificial Intelligence and Statistics (AISTATS), 2009.

D. Erhan, Y. Bengio, A. Courville, P. Manzagol, P. Vincent et al., Why does unsupervised pre-training help deep learning? The Journal of Machine Learning Research, 2010.

M. A. Fischler and R. A. Elschlager, The Representation and Matching of Pictorial Structures, IEEE Transactions on Computers, vol.22, issue.1, 1973.
DOI : 10.1109/T-C.1973.223602

L. Florack, B. T. Haar-romeny, J. Koenderink, and M. Viergever, General Intensity Transformations and Second Order Invariants, Theory & Applications of Image Analysis: Selected Papers from the 7th Scandinavian Conference on Image Analysis, 1992.
DOI : 10.1142/9789812797896_0003

W. T. Freeman and E. H. Adelson, The design and use of steerable filters. Transactions on Pattern Analysis & Machine Intelligence, 1991.

E. Gavves, B. Fernando, C. G. Snoek, A. W. Smeulders, and T. Tuytelaars, Fine-Grained Categorization by Alignments, 2013 IEEE International Conference on Computer Vision, 2013.
DOI : 10.1109/ICCV.2013.215

R. Girshick, J. Donahue, T. Darrell, and J. Malik, Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.81
URL : http://arxiv.org/pdf/1311.2524

Y. Gong, L. Wang, R. Guo, and S. Lazebnik, Multi-scale Orderless Pooling of Deep Convolutional Activation Features, European Conference on Computer Vision, 2014.
DOI : 10.1007/978-3-319-10584-0_26
URL : http://arxiv.org/pdf/1403.1840

R. Goroshin, J. Bruna, J. Tompson, D. Eigen, and Y. Lecun, Unsupervised feature learning from temporal data, Advances in Neural Information Processing Systems, 2014.

R. Goroshin, M. Mathieu, and Y. Lecun, learning to linearize under uncertainty, Advances in Neural Information Processing Systems, 2015.

P. Gosselin, N. Murray, H. Jégou, and F. Perronnin, Revisiting the Fisher vector for fine-grained classification, Pattern Recognition Letters, vol.49, 2014.
DOI : 10.1016/j.patrec.2014.06.011
URL : https://hal.archives-ouvertes.fr/hal-01056223

C. Harris and M. Stephens, A Combined Corner and Edge Detector, Procedings of the Alvey Vision Conference 1988, 1988.
DOI : 10.5244/C.2.23

K. He, X. Zhang, S. Ren, and J. Sun, Deep Residual Learning for Image Recognition, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
DOI : 10.1109/CVPR.2016.90
URL : http://arxiv.org/pdf/1512.03385

G. E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. R. Salakhutdinov, Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint, 2012.

C. Huang, S. Zhu, and K. Yu, Large scale strongly supervised ensemble metric learning, with applications to face verification and retrieval, 2012.

D. Jayaraman and K. Grauman, Learning image representations equivariant to ego-motion, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015.
DOI : 10.1109/iccv.2015.166
URL : http://arxiv.org/pdf/1505.02206

H. Jégou and O. Chum, Negative evidences and co-occurrences in image retrieval: the benefit of PCA and whitening, European Conference on Computer Vision, 2012.

H. Jégou, M. Douze, and C. Schmid, Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search, European Conference on Computer Vision, 2008.
DOI : 10.1109/CVPR.2007.383150

H. Jégou, M. Douze, and C. Schmid, On the burstiness of visual elements, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206609

H. Jégou, M. Douze, C. Schmid, and P. Pérez, Aggregating local descriptors into a compact image representation, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5540039

H. Jégou, M. Douze, and C. Schmid, Product Quantization for Nearest Neighbor Search, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.33, issue.1, 2011.
DOI : 10.1109/TPAMI.2010.57

H. Jégou, F. Perronnin, M. Douze, J. Sánchez, P. Pérez et al., Aggregating Local Image Descriptors into Compact Codes, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.9, 2012.
DOI : 10.1109/TPAMI.2011.235

Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long et al., Caffe, Proceedings of the ACM International Conference on Multimedia, MM '14, 2014.
DOI : 10.1145/2647868.2654889

W. Jiang, Y. Song, T. Leung, C. Rosenberg, J. Wang et al., Learning fine-grained image similarity with deep ranking, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014.

T. Kadir, A. Zisserman, and M. Brady, An Affine Invariant Salient Region Detector, Proceedings of the European Conference on Computer Vision, 2004.
DOI : 10.1007/978-3-540-24670-1_18

D. Keysers, T. Deselaers, C. Gollan, and H. Ney, Deformation Models for Image Recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.29, issue.8, 2007.
DOI : 10.1109/TPAMI.2007.1153
URL : http://www.iupr.org/~keysers/files/Keysers--Deformation-Models--TPAMI2007.pdf

J. Krause, M. Stark, J. Deng, and L. Fei-fei, 3d object representations for finegrained categorization, IEEE International Conference on Computer Vision Workshops, 2013.
DOI : 10.1109/iccvw.2013.77

A. Krizhevsky, I. Sutskever, and G. Hinton, ImageNet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems, 2012.
DOI : 10.1162/neco.2009.10-08-881
URL : http://dl.acm.org/ft_gateway.cfm?id=3065386&type=pdf

M. P. Kumar, B. Packer, and D. Koller, Self-paced learning for latent variable models, Advances in Neural Information Processing Systems, 2010.

C. H. Lampert, H. Nickisch, and S. Harmeling, Learning to detect unseen object classes by between-class attribute transfer, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206594

C. H. Lampert, H. Nickisch, and S. Harmeling, Attribute-Based Classification for Zero-Shot Visual Object Categorization, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.36, issue.3, 2013.
DOI : 10.1109/TPAMI.2013.140

Y. Lecun, B. Boser, J. Denker, D. Henderson, R. Howard et al., Handwritten digit recognition with a back-propagation network, Advances in Neural Information Processing Systems, 1989.

Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, Gradient-based learning applied to document recognition, Proc. of the IEEE, 1998.
DOI : 10.1109/5.726791

Y. Li, N. Snavely, and D. P. Huttenlocher, Location Recognition Using Prioritized Feature Matching, European Conference on Computer Vision, 2010.
DOI : 10.1007/978-3-642-15552-9_57
URL : http://www.cs.cornell.edu/%7Edph/papers/localization.pdf

T. Lin, A. Roychowdhury, and S. Maji, Bilinear CNN Models for Fine-Grained Visual Recognition, 2015 IEEE International Conference on Computer Vision (ICCV), 2015.
DOI : 10.1109/ICCV.2015.170

T. Lindeberg, Feature detection with automatic scale selection, International journal of computer vision, 1998.

J. Long, N. Zhang, and T. Darrell, Do Convnets learn correspondances?, Advances in Neural Information Processing Systems, 2014.

G. Loosli, S. Canu, and L. Bottou, Training invariant support vector machines using selective sampling, Large Scale Kernel Machines, 2007.

D. G. Lowe, Object recognition from local scale-invariant features, Proceedings of the Seventh IEEE International Conference on Computer Vision, 1999.
DOI : 10.1109/ICCV.1999.790410

D. G. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94

L. Maaten, M. Chen, S. Tyree, and K. Q. Weinberger, Learning with marginalized corrupted features, Proceedings of the 30th International Conference on Machine Learning (ICML-13), pp.410-418, 2013.

J. Mairal, F. Bach, and J. Ponce, Sparse Modeling for Image and Vision Processing. Foundations and Trends in Computer Graphics and Vision, 2014.
DOI : 10.1561/0600000058
URL : https://hal.archives-ouvertes.fr/hal-01081139

S. Maji, J. Kannala, E. Rahtu, M. Blaschko, and A. Vedaldi, Fine-grained visual classification of aircraft, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00842101

S. Mallat, A wavelet tour of signal processing, 2008.

J. Marín, D. Vázquez, D. Gerónimo, and A. López, Learning appearance in virtual scenarios for pedestrian detection, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5540218

K. Mikolajczyk and C. Schmid, An Affine Invariant Interest Point Detector, European Conference on Computer Vision, 2002.
DOI : 10.1007/3-540-47969-4_9
URL : https://hal.archives-ouvertes.fr/inria-00548252

K. Mikolajczyk and C. Schmid, Scale & Affine Invariant Interest Point Detectors, International Journal of Computer Vision, vol.60, issue.1, 2004.
DOI : 10.1023/B:VISI.0000027790.02288.f2
URL : https://hal.archives-ouvertes.fr/inria-00548554

K. Mikolajczyk and C. Schmid, A performance evaluation of local descriptors, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005.
URL : https://hal.archives-ouvertes.fr/inria-00548529

K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas et al., A Comparison of Affine Region Detectors, International Journal of Computer Vision, vol.65, issue.1-2, 2005.
DOI : 10.1007/s11263-005-3848-x
URL : https://hal.archives-ouvertes.fr/inria-00548528

J. Y. Ng, F. Yang, and L. S. Davis, Exploiting local features from deep networks for image retrieval, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2015.
DOI : 10.1109/CVPRW.2015.7301272
URL : http://arxiv.org/pdf/1504.05133

D. Nister and H. Stewenius, Scalable Recognition with a Vocabulary Tree, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.264

P. Niyogi, F. Girosi, and T. Poggio, Incorporating prior information in machine learning by creating virtual examples, Proceedings of the IEEE, 1998.
DOI : 10.1109/5.726787

M. Paulin, J. Mairal, M. Douze, Z. Harchaoui, F. Perronnin et al., Convolutional Patch Representations for Image Retrieval: An Unsupervised Approach, International Journal of Computer Vision, vol.34, issue.3, 2016.
DOI : 10.1109/CVPR.2015.7298767
URL : https://hal.archives-ouvertes.fr/hal-01277109

M. Perd-'och, O. Chum, and J. Matas, Efficient representation of local geometry for large scale object retrieval, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009.

F. Perronnin and C. Dance, Fisher Kernels on Visual Vocabularies for Image Categorization, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383266

F. Perronnin and D. Larlus, Fisher vectors meet Neural Networks: A hybrid classification architecture, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.3743-3752, 2015.
DOI : 10.1109/CVPR.2015.7298998

F. Perronnin, J. Sánchez, and Y. Liu, Large-scale image categorization with explicit data embedding, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539914

J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman, Object retrieval with large vocabularies and fast spatial matching, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383172

J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman, Lost in quantization: Improving particular object retrieval in large scale image databases, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587635
URL : http://www.di.ens.fr/willow/pdfs/philbin08.pdf

J. Philbin, M. Isard, J. Sivic, and A. Zisserman, Descriptor Learning for Efficient Retrieval, European Conference on Computer Vision, 2010.
DOI : 10.1007/978-3-642-15558-1_49
URL : http://www.di.ens.fr/willow/pdfs/philbin10b.pdf

L. Pishchulin, A. Jain, C. Wojek, M. Andriluka, T. Thormählen et al., Learning people detection models from few training samples, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995574

A. S. Razavian, H. Azizpour, J. Sullivan, and S. Carlsson, CNN features off-theshelf: an astounding baseline for recognition, 2014.
DOI : 10.1109/cvprw.2014.131
URL : http://arxiv.org/pdf/1403.6382

J. Sánchez, F. Perronnin, and T. De-campos, Modeling the spatial layout of images beyond spatial pyramids, Pattern Recognition Letters, vol.33, issue.16, 2012.
DOI : 10.1016/j.patrec.2012.07.019

J. Sánchez, F. Perronnin, T. Mensink, and J. J. Verbeek, Image Classification with the Fisher Vector: Theory and Practice, International Journal of Computer Vision, vol.73, issue.2, 2013.
DOI : 10.1007/s11263-006-9794-4

B. Schölkopf and A. J. Smola, Learning with kernels: Support vector machines, regularization, optimization, and beyond, 2002.

S. Shalev-shwartz and N. Srebro, SVM optimization, Proceedings of the 25th international conference on Machine learning, ICML '08, 2008.
DOI : 10.1145/1390156.1390273

J. Sietsma and R. Dow, Creating artificial neural networks that generalize, Neural Networks, vol.4, issue.1, 1991.
DOI : 10.1016/0893-6080(91)90033-2

J. Sietsma and R. J. Dow, Creating artificial neural networks that generalize, Neural Networks, vol.4, issue.1, 1991.
DOI : 10.1016/0893-6080(91)90033-2

P. Simard, Y. Lecun, and J. Denker, Efficient pattern recognition using a new transformation distance, Advances in Neural Information Processing Systems, 1992.

P. Simard, D. Steinkraus, and J. C. Platt, Best practices for convolutional neural networks applied to visual document analysis, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings., 2003.
DOI : 10.1109/ICDAR.2003.1227801

E. Simo-serra, E. Trulls, L. Ferraz, I. Kokkinos, and F. Moreno-noguer, Discriminative Learning of Deep Convolutional Feature Point Descriptors, 2015 IEEE International Conference on Computer Vision (ICCV), 2015.
DOI : 10.1109/ICCV.2015.22
URL : http://upcommons.upc.edu/bitstream/2117/84259/1/1694-Discriminative-Learning-of-Deep-Convolutional-Feature-Point-Descriptors%281%29.pdf

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, 2014.

K. Simonyan, A. Vedaldi, and A. Zisserman, Learning Local Feature Descriptors Using Convex Optimisation, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.36, issue.8, 2014.
DOI : 10.1109/TPAMI.2014.2301163

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, 2003.
DOI : 10.1109/ICCV.2003.1238663

K. Sohn and H. Lee, Learning invariant representations with local transformations, International Conference on Machine Learning, 2012.

R. Szeliski, Computer Vision: Algorithms and Applications, 2010.
DOI : 10.1007/978-1-84882-935-0

E. Tola, V. Lepetit, and P. Fua, DAISY: An Efficient Dense Descriptor Applied to Wide-Baseline Stereo, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.5, 2010.
DOI : 10.1109/TPAMI.2009.77
URL : http://cvlab.epfl.ch/publications/publications/2010/TolaLF10a.pdf

G. Tolias, Y. Avrithis, and H. Jégou, To Aggregate or Not to aggregate: Selective Match Kernels for Image Search, 2013 IEEE International Conference on Computer Vision, 2013.
DOI : 10.1109/ICCV.2013.177
URL : https://hal.archives-ouvertes.fr/hal-00864684

G. Tolias, R. Sicre, and H. Jégou, Particular Object Retrieval with Integral Max- Pooling of CNN Activations, International Conference on Representation Learning, 2015.

T. Tuytelaars and K. Mikolajczyk, Local invariant feature detectors: A survey. Foundations and Trends in Computer Graphics and Vision, 2007.
DOI : 10.1561/0600000017
URL : http://epubs.surrey.ac.uk/726872/1/Tuytelaars-FGV-2008.pdf

T. Tuytelaars and K. Mikolajczyk, Local invariant feature detectors: A survey. Foundations and Trends in Computer Graphics and Vision, 2008.
DOI : 10.1561/0600000017
URL : http://epubs.surrey.ac.uk/726872/1/Tuytelaars-FGV-2008.pdf

V. N. Vapnik, Statistical Learning Theory, 1998.

A. Vedaldi and A. Zisserman, Efficient additive kernels via explicit feature maps, IEEE Transactions on Pattern Analysis and Machine Intelligence, p.2012
DOI : 10.1109/cvpr.2010.5539949
URL : http://eprints.pascal-network.org/archive/00006964/01/vedaldi10.pdf

P. Vincent, H. Larochelle, Y. Bengio, and P. Manzagol, Extracting and composing robust features with denoising autoencoders, Proceedings of the 25th international conference on Machine learning, ICML '08, 2008.
DOI : 10.1145/1390156.1390294

C. Wah, S. Branson, P. Welinder, P. Perona, and S. Belongie, The CUB-200-2011 Dataset, 2011.

L. Wan, M. Zeiler, S. Zhang, Y. Lecun, and R. Fergus, Regularization of neural networks using dropconnect, International Conference on Machine Learning, 2013.

Z. Wang, B. Fan, and F. Wu, Local intensity order pattern for feature description, International Conference on Computer Vision, 2011.

C. Williams and M. Seeger, Using the Nyström method to speed up kernel machines, Advances in Neural Information Processing Systems, 2001.

S. Winder, G. Hua, and M. Brown, Picking the best DAISY, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206839

C. Wu, SiftGPU: A GPU implementation of scale invariant feature transform (SIFT), 2007.

L. Yaeger, R. Lyon, and B. Webb, Effective training of a neural network character classifier for word recognition, Advances in Neural Image Processing Sytems, 1996.

J. Yosinski, J. Clune, Y. Bengio, and H. Lipson, How transferable are features in deep neural networks?, Advances in Neural Information Processing Systems, 2014.

S. Zagoruyko and N. Komodakis, Learning to compare image patches via convolutional neural networks, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
DOI : 10.1109/CVPR.2015.7299064
URL : https://hal.archives-ouvertes.fr/hal-01246261

J. Zbontar and Y. Lecun, Computing the stereo matching cost with a convolutional neural network, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014.
DOI : 10.1109/CVPR.2015.7298767