R. Arandjelovi´carandjelovi´c and A. Zisserman, Three things everyone should know to improve object retrieval, CVPR, 2012.

R. Arandjelovi´carandjelovi´c and A. Zisserman, All about VLAD, CVPR, 2013.

M. Belkin and P. Niyogi, Laplacian Eigenmaps for Dimensionality Reduction and Data Representation, Neural Computation, vol.15, issue.6, 2003.
DOI : 10.1126/science.290.5500.2319

L. Bo and C. Sminchisescu, Efficient match kernels between sets of features for visual recognition, NIPS, 2009.

Y. Boureau, F. Bach, Y. Lecun, and J. Ponce, Learning midlevel features for recognition, CVPR, 2010.

O. Chum and J. Matas, Unsupervised discovery of cooccurrence in sparse high dimensional data, CVPR, 2010.

Y. Fu, M. Liu, and T. S. Huang, Conformal Embedding Analysis with Local Graph Modeling on the Unit Hypersphere, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383410

K. Grauman and T. Darrell, The pyramid match kernel: discriminative classification with sets of image features, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005.
DOI : 10.1109/ICCV.2005.239

C. Hegde, A. C. Sankaranarayanan, W. Yin, and R. G. Baraniuk, NuMax: A Convex Approach for Learning Near-Isometric Linear Embeddings, IEEE Transactions on Signal Processing, vol.63, issue.22, 2012.
DOI : 10.1109/TSP.2015.2452228

H. Jégou and O. Chum, Negative evidences and cooccurences in image retrieval: The benefit of PCA and whitening, ECCV, 2012. [11] H. Jégou, M. Douze, and C. Schmid. On the burstiness of visual elements CVPR, 2009.

H. Jégou, M. Douze, and C. Schmid, Improving bag-offeatures for large scale image search, 2010.

H. Jégou, F. Perronnin, M. Douze, J. Sánchez, P. Pérez et al., Aggregating Local Image Descriptors into Compact Codes, PAMI, 2012.
DOI : 10.1109/TPAMI.2011.235

H. Jégou, C. Schmid, H. Harzallah, and J. Verbeek, Accurate Image Search Using the Contextual Dissimilarity Measure, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.1, 2010.
DOI : 10.1109/TPAMI.2008.285

P. A. Knight, The Sinkhorn???Knopp Algorithm: Convergence and Applications, SIAM Journal on Matrix Analysis and Applications, vol.30, issue.1, pp.261-275, 2008.
DOI : 10.1137/060659624

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.68

URL : https://hal.archives-ouvertes.fr/inria-00548585

W. Liu, J. Wang, S. Kumar, and S. Chang, Hashing with graphs, ICML, 2011.

D. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94

K. Mikolajczyk and C. Schmid, Scale & Affine Invariant Interest Point Detectors, International Journal of Computer Vision, vol.60, issue.1, pp.63-86, 2004.
DOI : 10.1023/B:VISI.0000027790.02288.f2

URL : https://hal.archives-ouvertes.fr/inria-00548554

N. Murray and F. Perronnin, Generalized Max Pooling, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.317

D. Nistér and H. Stewénius, Scalable Recognition with a Vocabulary Tree, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.264

F. Perronnin and C. R. Dance, Fisher Kernels on Visual Vocabularies for Image Categorization, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383266

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, ECCV, 2010.
DOI : 10.1007/978-3-642-15561-1_11

URL : https://hal.archives-ouvertes.fr/inria-00548630

J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman, Object retrieval with large vocabularies and fast spatial matching, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383172

J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman, Lost in quantization: Improving particular object retrieval in large scale image databases, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587635

N. L. Roux and F. Bach, Local component analysis, Proc. Intl Conf. on Learning Representations, 2013.
URL : https://hal.archives-ouvertes.fr/inria-00617965

B. Safadi and G. Quénot, Descriptor optimization for multimedia indexing and retrieval, CBMI, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00981672

K. Simonyan, A. Vedaldi, and A. Zisserman, Learning Local Feature Descriptors Using Convex Optimisation, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.36, issue.8, 2014.
DOI : 10.1109/TPAMI.2014.2301163

R. Sinkhorn, A Relationship Between Arbitrary Positive Matrices and Doubly Stochastic Matrices, The Annals of Mathematical Statistics, vol.35, issue.2, pp.876-879, 1964.
DOI : 10.1214/aoms/1177703591

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, 2003.
DOI : 10.1109/ICCV.2003.1238663

G. Tolias, Y. Avrithis, and H. Jégou, To Aggregate or Not to aggregate: Selective Match Kernels for Image Search, 2013 IEEE International Conference on Computer Vision, 2013.
DOI : 10.1109/ICCV.2013.177

URL : https://hal.archives-ouvertes.fr/hal-00864684

J. Wang, J. Yang, F. L. Yu, T. Huang, and Y. Gong, Locality-constrained Linear Coding for image classification, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5540018

S. Winder and M. Brown, Learning Local Image Descriptors, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.382971

R. Arandjelovi´carandjelovi´c and A. Zisserman, Three things everyone should know to improve object retrieval, CVPR, 2012.

H. Jégou, M. Douze, and C. Schmid, Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search, ECCV, 2008.
DOI : 10.1007/978-3-540-88682-2_24

H. Jégou, M. Douze, and C. Schmid, On the burstiness of visual elements, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206609

D. Nistér and H. Stewénius, Scalable Recognition with a Vocabulary Tree, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.264

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, ECCV, 2010.
DOI : 10.1007/978-3-642-15561-1_11

URL : https://hal.archives-ouvertes.fr/inria-00548630

K. Simonyan, A. Vedaldi, and A. Zisserman, Learning Local Feature Descriptors Using Convex Optimisation, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.36, issue.8, 2014.
DOI : 10.1109/TPAMI.2014.2301163

S. Winder and M. Brown, Learning Local Image Descriptors, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.382971