B. Bai, J. Weston, D. Grangier, R. Collobert, Y. Qi et al., Learning to rank with (a lot of) word features, Information Retrieval, vol.22, issue.1, 2010.
DOI : 10.1007/s10791-009-9117-9

S. Bengio, J. Weston, and D. Grangier, Label embedding trees for large multi-class tasks, NIPS, 2011.

L. Bottou, Large-scale machine learning with stochastic gradient descent, COMPSTAT, 2010.

S. Boyd and L. Vandenberghe, Convex Optimization, 2004.

J. Chai, H. Liua, B. Chenb, and Z. Baoa, Large margin nearest local mean classifier, Signal Processing, vol.90, issue.1, 2010.
DOI : 10.1016/j.sigpro.2009.06.015

G. Checkik, V. Sharma, U. Shalit, and S. Bengio, Large Scale Online Learning of Image Similarity through Ranking, Journal of Machine Learning Research, vol.11, pp.1109-1135, 2010.
DOI : 10.1007/978-3-642-02172-5_2

S. Clinchant, G. Csurka, F. Perronnin, and J. Renders, XRCE's participation to ImagEval, ImageEval Workshop at CVIR, 2007.

G. Csurka, C. Dance, L. Fan, J. Willamowski, and C. Bray, Visual categorization with bags of keypoints, ECCV Int. Workshop on Stat. Learning in Computer Vision, 2004.

J. Deng, A. Berg, K. Li, and L. Fei-fei, What Does Classifying More Than 10,000 Image Categories Tell Us?, ECCV, 2010.
DOI : 10.1007/978-3-642-15555-0_6
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.173.3680

J. Deng, W. Dong, R. Socher, L. Li, K. Li et al., ImageNet: A large-scale hierarchical image database, CVPR, 2009.

L. Fei-fei, R. Fergus, and P. Perona, One-shot learning of object categories, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, issue.4, pp.594-611, 2006.
DOI : 10.1109/TPAMI.2006.79

T. Gao and D. Koller, Discriminative learning of relaxed hierarchy for large-scale visual recognition, ICCV, 2011.

J. Gauvain and C. Lee, Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains, IEEE Transactions on Speech and Audio Processing, vol.2, issue.2, 1994.
DOI : 10.1109/89.279278

A. Globerson and S. Roweis, Metric learning by collapsing classes, NIPS, 2006.

J. Goldberger, S. Roweis, G. Hinton, and R. Salakhutdinov, Neighbourhood component analysis, NIPS, 2005.

A. Gordo, J. Rodríguez, F. Perronnin, and E. Valveny, Leveraging category-level labels for instance-level image retrieval, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6248035

R. Gray and D. Neuhoff, Quantization, IEEE Transactions on Information Theory, vol.44, issue.6, pp.2325-2383, 1998.
DOI : 10.1109/18.720541

M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid, TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459266
URL : https://hal.archives-ouvertes.fr/inria-00439276

M. Guillaumin, J. Verbeek, and C. Schmid, Is that you? Metric learning approaches for face identification, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459197
URL : https://hal.archives-ouvertes.fr/inria-00439290

H. Jégou, M. Douze, and C. Schmid, Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search, ECCV, 2008.
DOI : 10.1007/978-3-540-88682-2_24

H. Jégou, M. Douze, and C. Schmid, Product Quantization for Nearest Neighbor Search, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.33, issue.1, 2011.
DOI : 10.1109/TPAMI.2010.57

H. Jégou, F. Perronnin, M. Douze, J. Sánchez, P. Pérez et al., Aggregating Local Image Descriptors into Compact Codes, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.9, 2012.
DOI : 10.1109/TPAMI.2011.235

M. Köstinger, M. Hirzer, P. Wohlhart, P. Roth, and H. Bischof, Large scale metric learning from equivalence constraints, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6247939

C. Lampert, H. Nickisch, and S. Harmeling, Learning to detect unseen object classes by between-class attribute transfer, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206594

H. Larochelle, D. Erhan, and Y. Bengio, Zero-data learning of new tasks, AAAI Conference on Artificial Intelligence, 2008.

Q. Le, M. Ranzato, R. Monga, M. Devin, K. Chen et al., Building high-level features using large scale unsupervised learning, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 2012.
DOI : 10.1109/ICASSP.2013.6639343

Y. Lin, F. Lv, S. Zhu, M. Yang, T. Cour et al., Large-scale image classification: Fast feature extraction and SVM training, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995477
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.225.3736

D. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.14.4931

A. Lucchi and J. Weston, Joint Image and Word Sense Discrimination for Image Retrieval, ECCV, 2012.
DOI : 10.1007/978-3-642-33718-5_10

T. Mensink, J. Verbeek, F. Perronnin, and G. Csurka, Metric Learning for Large Scale Image Classification: Generalizing to New Classes at Near-Zero Cost, ECCV, 2012.
DOI : 10.1007/978-3-642-33709-3_35
URL : https://hal.archives-ouvertes.fr/hal-00722313

T. Mensink, J. Verbeek, F. Perronnin, and G. Csurka, Distance-Based Image Classification: Generalizing to New Classes at Near-Zero Cost, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, issue.11, 2012.
DOI : 10.1109/TPAMI.2013.83
URL : https://hal.archives-ouvertes.fr/hal-00817211

D. Nistér and H. Stewénius, Scalable Recognition with a Vocabulary Tree, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.264

E. Nowak and F. Jurie, Learning Visual Similarity Measures for Comparing Never Seen Objects, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.382969
URL : https://hal.archives-ouvertes.fr/hal-00203958

S. Parameswaran and K. Q. Weinberger, Large margin multi-task metric learning, NIPS, 2010.

F. Perronnin, Z. Akata, Z. Harchaoui, and C. Schmid, Towards good practice in large-scale learning for image classification, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6248090
URL : https://hal.archives-ouvertes.fr/hal-00690014

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, ECCV, 2010.
DOI : 10.1007/978-3-642-15561-1_11
URL : https://hal.archives-ouvertes.fr/inria-00548630

M. Rohrbach, M. Stark, and B. Schiele, Evaluating knowledge transfer and zero-shot learning in a large-scale setting, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995627

K. Saenko, B. Kulis, M. Fritz, and T. Darrell, Adapting Visual Category Models to New Domains, ECCV, 2010.
DOI : 10.1007/978-3-642-15561-1_16

J. Sánchez and F. Perronnin, High-dimensional signature compression for large-scale image classification, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995504

T. Tommasi and B. Caputo, The more you know, the less you learn: from knowledge transfer to one-shot learning of object categories, Procedings of the British Machine Vision Conference 2009, 2009.
DOI : 10.5244/C.23.80

C. Veenman and D. Tax, LESS: a model-based classifier for sparse subspaces, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.27, issue.9, pp.1496-1500, 2005.
DOI : 10.1109/TPAMI.2005.182

A. R. Webb, Statistical pattern recognition, 2002.
DOI : 10.1002/9781119952954

K. Weinberger, J. Blitzer, and L. Saul, Distance metric learning for large margin nearest neighbor classification, NIPS, 2006.

K. Weinberger and L. Saul, Distance metric learning for large margin nearest neighbor classification, Journal of Machine Learning Research, vol.10, pp.207-244, 2009.

K. Q. Weinberger and O. Chapelle, Large margin taxonomy embedding for document categorization, NIPS, 2009.

J. Weston, S. Bengio, and N. Usunier, WSABIE: Scaling up to large vocabulary image annotation, IJCAI, 2011.

J. Zhang, M. Marsza?ek, S. Lazebnik, and C. Schmid, Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study, International Journal of Computer Vision, vol.36, issue.1, pp.213-238, 2007.
DOI : 10.1007/s11263-006-9794-4
URL : https://hal.archives-ouvertes.fr/inria-00548574

X. Zhou, X. Zhang, Z. Yan, S. Chang, M. Hasegawa-johnson et al., SIFT-Bag kernel for video event analysis, Proceeding of the 16th ACM international conference on Multimedia, MM '08, 2008.
DOI : 10.1145/1459359.1459391