R. Achanta, A. Shaji, K. Smith, P. Lucchi, S. Fua et al., SLIC Superpixels Compared to State-of-the-Art Superpixel Methods, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.11, pp.2274-2282, 2012.
DOI : 10.1109/TPAMI.2012.120

Z. Akata, F. Perronnin, Z. Harchaoui, and C. Schmid, Good Practice in Large-Scale Learning for Image Classification, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.36, issue.3, 2013.
DOI : 10.1109/TPAMI.2013.146

URL : https://hal.archives-ouvertes.fr/hal-00690014

R. Arandjelovic and A. Zisserman, Three things everyone should know to improve object retrieval, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6248018

Y. Avrithis and K. Rapantzikos, The medial feature detector: Stable regions from image boundaries, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126436

A. Babenko, A. Slesarev, A. Chigorin, and V. Lempitsky, Neural Codes for Image Retrieval, ECCV, 2014.
DOI : 10.1007/978-3-319-10590-1_38

H. Bay, A. Ess, T. Tuytelaars, and L. V. , Speeded-Up Robust Features (SURF), Computer Vision and Image Understanding, vol.110, issue.3, pp.346-359, 2008.
DOI : 10.1016/j.cviu.2007.09.014

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.205.738

A. Bhatia and E. Wolf, On the circle polynomials of Zernike and related orthogonal sets, Mathematical Proceedings of the Cambridge Philosophical Society, pp.40-48, 1954.
DOI : 10.1016/0031-8914(47)90052-9

S. Branson, G. V. Horn, S. Belongie, and P. Perona, Bird species categorization using pose normalized deep convolutional nets, Arxiv, Tech. Rep, 2014.

M. Calonder, V. Lepetit, C. Strecha, and P. Fua, BRIEF: Binary Robust Independent Elementary Features, ECCV, 2010.
DOI : 10.1007/978-3-642-15561-1_56

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.175.2122

V. Chandrasekhar, G. Takacs, D. Chen, S. Tsai, R. Grzeszczuk et al., CHoG: compressed histogram of gradients, CVPR, 2009.

S. Clinchant, G. Csurka, F. Perronnin, and J. Renders, XRCEs participation to imageval, ImageEval workshop at CVIR, 2007.

J. Delhumeau, P. Gosselin, H. Jégou, and P. Pérez, Revisiting the VLAD image representation, Proceedings of the 21st ACM international conference on Multimedia, MM '13, 2013.
DOI : 10.1145/2502081.2502171

URL : https://hal.archives-ouvertes.fr/hal-00840653

P. Dollár and C. L. Zitnick, Structured Forests for Fast Edge Detection, 2013 IEEE International Conference on Computer Vision, pp.1841-1848, 2013.
DOI : 10.1109/ICCV.2013.231

J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang et al., Decaf: A deep convolutional activation feature for generic visual recognition, ICML, 2014.

L. Fei-fei and P. Perona, A Bayesian Hierarchical Model for Learning Natural Scene Categories, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005.
DOI : 10.1109/CVPR.2005.16

P. F. Felzenszwalb and D. P. Huttenlocher, Efficient Graph-Based Image Segmentation, International Journal of Computer Vision, vol.59, issue.2, pp.167-181, 2004.
DOI : 10.1023/B:VISI.0000022288.19776.77

M. A. Fischler and R. C. Bolles, Random sample consensus, Communications of ACM, vol.6, issue.24, pp.381-395, 1981.

R. Girshick, J. Donahue, T. Darrell, and J. Malik, Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.81

P. Gosselin, N. Murray, H. Jégou, and F. Perronnin, Revisiting the Fisher vector for fine-grained classification, Pattern Recognition Letters, vol.49, 2014.
DOI : 10.1016/j.patrec.2014.06.011

URL : https://hal.archives-ouvertes.fr/hal-01056223

C. Harris and M. Stephens, A Combined Corner and Edge Detector, Procedings of the Alvey Vision Conference 1988, 1988.
DOI : 10.5244/C.2.23

M. Jain, R. Benmokhtar, P. Gros, and H. Jégou, Hamming embedding similarity-based image classification, Proceedings of the 2nd ACM International Conference on Multimedia Retrieval, ICMR '12, 2012.
DOI : 10.1145/2324796.2324820

URL : https://hal.archives-ouvertes.fr/hal-00688169

M. Jain, H. Jégou, and P. Bouthemy, Better motion for better action recognition, CVPR, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00813014

H. Jégou, M. Douze, and C. Schmid, Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search, ECCV, 2008.
DOI : 10.1007/978-3-540-88682-2_24

H. Jégou, M. Douze, C. Schmid, and P. Pérez, Aggregating local descriptors into a compact image representation, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5540039

A. Khotanzad and Y. H. Hong, Invariant image recognition by Zernike moments, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.12, issue.5, pp.489-497, 1990.
DOI : 10.1109/34.55109

P. Koniusz, F. Yan, and K. Mikolajczyk, Comparison of mid-level feature coding approaches and pooling strategies in visual concept detection, Computer Vision and Image Understanding, vol.117, issue.5, pp.479-492, 2013.
DOI : 10.1016/j.cviu.2012.10.010

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, NIPS, 2012.

T. Lindeberg, Feature detection with automatic scale selection, pp.77-116, 1998.

D. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.14.4931

S. Maji, J. Kannala, E. Rahtu, M. Blaschko, and A. Vedaldi, Finegrained visual classification of aircraft, Arxiv, Tech. Rep, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00842101

J. Matas, O. Chum, U. Martin, and T. Pajdla, Robust wide baseline stereo from maximally stable extremal regions, BMVC, pp.384-393, 2002.

S. Mccann and D. G. Lowe, Spatially Local Coding for Object Recognition, Computer Vision?ACCV 2012, pp.204-217, 2013.
DOI : 10.1007/978-3-642-37331-2_16

K. Mikolajczyk, A. Zisserman, and C. Schmid, Shape recognition with edge-based features, Procedings of the British Machine Vision Conference 2003, 2003.
DOI : 10.5244/C.17.79

URL : https://hal.archives-ouvertes.fr/inria-00548226

K. Mikolajczyk and C. Schmid, Scale & Affine Invariant Interest Point Detectors, International Journal of Computer Vision, vol.60, issue.1, pp.63-86, 2004.
DOI : 10.1023/B:VISI.0000027790.02288.f2

URL : https://hal.archives-ouvertes.fr/inria-00548554

K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas et al., A Comparison of Affine Region Detectors, International Journal of Computer Vision, vol.65, issue.1-2, pp.43-72, 2005.
DOI : 10.1007/s11263-005-3848-x

URL : https://hal.archives-ouvertes.fr/inria-00548528

N. Murray and F. Perronnin, Generalized Max Pooling, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.317

URL : http://arxiv.org/abs/1406.0312

M. Nilsback and A. Zisserman, Automated Flower Classification over a Large Number of Classes, 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing, 2008.
DOI : 10.1109/ICVGIP.2008.47

E. Nowak, F. Jurie, B. Triggs, ?. S. Obdr?álek, and J. Matas, Sampling strategies for bag-offeatures image classification Object recognition using local affine frames on maximally stable extremal regions, ECCV Toward Category-Level Object Recognition, pp.83-104, 2006.

O. M. Parkhi, A. Vedaldi, A. Zisserman, and C. V. Jawahar, Cats and dogs, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6248092

M. Perdoch, O. Chum, and J. Matas, Efficient representation of local geometry for large scale object retrieval, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206529

F. Perronnin and C. R. Dance, Fisher Kernels on Visual Vocabularies for Image Categorization, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383266

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, ECCV, 2010.
DOI : 10.1007/978-3-642-15561-1_11

URL : https://hal.archives-ouvertes.fr/inria-00548630

J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman, Object retrieval with large vocabularies and fast spatial matching, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383172

A. S. Razavian, H. Azizpour, J. Sullivan, and S. Carlsson, CNN Features Off-the-Shelf: An Astounding Baseline for Recognition, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2014.
DOI : 10.1109/CVPRW.2014.131

J. Revaud, G. Lavoué, and A. Baskurt, Improving Zernike Moments Comparison for Optimal Similarity and Rotation Angle Retrieval, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.31, issue.4, pp.627-636, 2009.
DOI : 10.1109/TPAMI.2008.115

URL : https://hal.archives-ouvertes.fr/hal-01437606

K. Simonyan, A. Vedaldi, and A. Zisserman, Learning Local Feature Descriptors Using Convex Optimisation, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.36, issue.8, 2014.
DOI : 10.1109/TPAMI.2014.2301163

C. Teh, R. T. Chin-avrithis, and H. Jégou, On image analysis by the methods of moments, ICCV, pp.496-513, 1988.
DOI : 10.1109/34.3913

G. Tolias and H. Jégou, Visual query expansion with or without geometry: Refining local descriptors by feature aggregation, Pattern Recognition, vol.47, issue.10, 2014.
DOI : 10.1016/j.patcog.2014.04.007

URL : https://hal.archives-ouvertes.fr/hal-00971267

T. Tuytelaars, Dense interest points, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539911

J. R. Uijlings, K. E. Van-de-sande, T. Gevers, and A. W. Smeulders, Selective Search for Object Recognition, International Journal of Computer Vision, vol.57, issue.1, pp.154-171, 2013.
DOI : 10.1007/s11263-013-0620-5

C. Wah, S. Branson, P. Welinder, P. Perona, and S. Belongie, The Caltech-UCSD Birds-200-2011 Dataset, California Institute of Technology, 2011.

H. Wang, A. Kläser, C. Schmid, and L. Cheng-lin, Action recognition by dense trajectories, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995407

URL : https://hal.archives-ouvertes.fr/inria-00583818

Z. Wang, J. Feng, and S. Yan, Collaborative Linear Coding for Robust Image Classification, International Journal of Computer Vision, vol.21, issue.2, pp.1-12, 2014.
DOI : 10.1007/s11263-014-0739-z

S. Winder and M. Brown, Learning Local Image Descriptors, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.382971

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.686.3515

S. Winder, G. Hua, and M. Brown, Picking the best DAISY, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206839

F. Zernike, Beugungstheorie des schneidenver-fahrens und seiner verbesserten form, der phasenkontrastmethode, Physica, vol.1, issue.7-12, pp.689-704, 1934.
DOI : 10.1016/S0031-8914(34)80259-5

W. Zhao, H. Jégou, and G. Gravier, Oriented pooling for dense and non-dense rotation-invariant features, Procedings of the British Machine Vision Conference 2013, 2013.
DOI : 10.5244/C.27.99

URL : https://hal.archives-ouvertes.fr/hal-00841590