B. Bai, J. Weston, D. Grangier, R. Collobert, O. Chapelle et al., Supervised semantic indexing, CIKM, 2009.
DOI : 10.1007/978-3-642-00958-7_81

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.162.1162

P. L. Bartlett, M. I. Jordan, and J. D. Mcauliffe, Convexity, Classification, and Risk Bounds, NIPS, 2003.
DOI : 10.1198/016214505000000907

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.15.3497

S. Bengio, J. Weston, and D. Grangier, Label embedding trees for large multi-class tasks, NIPS, 2010.

A. Bordes, L. Bottou, P. Gallinari, and J. Weston, Solving multiclass support vector machines with LaRank, Proceedings of the 24th international conference on Machine learning, ICML '07, 2007.
DOI : 10.1145/1273496.1273508

URL : https://hal.archives-ouvertes.fr/hal-00750277

L. Bottou and O. Bousquet, The tradeoffs of large scale learning, NIPS, 2007.

K. Chatfield, V. Lempitsky, A. Vedaldi, and A. Zisserman, The devil is in the details: an evaluation of recent feature encoding methods, Procedings of the British Machine Vision Conference 2011, 2005.
DOI : 10.5244/C.25.76

K. Crammer and Y. Singer, On the algorithmic implementation of multiclass kernel-based vector machines, JMLR, vol.1, issue.3, 2001.

J. Deng, A. Berg, K. Li, and L. Fei-fei, What Does Classifying More Than 10,000 Image Categories Tell Us?, ECCV, 2007.
DOI : 10.1007/978-3-642-15555-0_6

J. Deng, W. Dong, R. Socher, L. Li, K. Li et al., ImageNet: A large-scale hierarchical image database, CVPR, 2005.

R. Fan, K. Chang, C. Hsieh, X. Wang, and C. Lin, LIBLINEAR: A library for large linear classification, JMLR, issue.4, 2008.

T. Gao and D. Koller, Discriminative learning of relaxed hierarchy for large-scale visual recognition, ICCV, 2011.

H. Jégou, M. Douze, and C. Schmid, Product Quantization for Nearest Neighbor Search, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.33, issue.1, 2011.
DOI : 10.1109/TPAMI.2010.57

T. Joachims, Optimizing search engines using clickthrough data, Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '02, 2002.
DOI : 10.1145/775047.775067

T. Joachims, Training linear SVMs in linear time, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '06, 2006.
DOI : 10.1145/1150402.1150429

Y. Lecun, L. Bottou, G. Orr, and K. Muller, Efficient backprop, Neural Networks: Tricks of the trade, 1998.

Y. Lin, F. Lv, S. Zhu, M. Yang, T. Cour et al., Large-scale image classification: Fast feature extraction and SVM training, CVPR 2011, 2007.
DOI : 10.1109/CVPR.2011.5995477

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.225.3736

D. Lowe-]-s, A. Maji, and . Berg, Distinctive image features from scale-invariant keypoints Max-margin additive classifiers for detection, ICCV, 2004.

M. Marszalek and C. Schmid, Constructing Category Hierarchies for Visual Recognition, ECCV, 2008.
DOI : 10.1007/978-3-540-88693-8_35

URL : https://hal.archives-ouvertes.fr/inria-00548656

S. Nowozin and C. Lampert, Structured learning and prediction in computer vision. Foundations and Trends in Computer Graphics and Vision, 2011.

F. Perronnin and C. Dance, Fisher Kernels on Visual Vocabularies for Image Categorization, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383266

F. Perronnin, J. Sánchez, and Y. Liu, Large-scale image categorization with explicit data embedding, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004.
DOI : 10.1109/CVPR.2010.5539914

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, ECCV, 2010.
DOI : 10.1007/978-3-642-15561-1_11

URL : https://hal.archives-ouvertes.fr/inria-00548630

R. Rifkin and A. Klautau, In defense of one-vs-all classification, JMLR, issue.2, 2004.

M. Rohrbach, M. Stark, and B. Schiele, Evaluating knowledge transfer and zero-shot learning in a large-scale setting, CVPR 2011, 2004.
DOI : 10.1109/CVPR.2011.5995627

J. Sánchez and F. Perronnin, High-dimensional signature compression for large-scale image classification, CVPR 2011, 2007.
DOI : 10.1109/CVPR.2011.5995504

S. Shalev-shwartz, Y. Singer, and N. Srebro, Pegasos, Proceedings of the 24th international conference on Machine learning, ICML '07, 2007.
DOI : 10.1145/1273496.1273598

A. Tewari and P. L. Bartlett, On the Consistency of Multiclass Classification Methods, JMLR, issue.3, pp.1007-1025, 2007.
DOI : 10.1007/11503415_10

A. Torralba, R. Fergus, and W. Freeman, 80 million tiny images: a large dataset for non-parametric object and scene recognition, IEEE PAMI, issue.2, 2008.

L. Torresani, M. Szummer, and A. Fitzgibbon, Efficient Object Category Recognition Using Classemes, ECCV, 2010.
DOI : 10.1007/978-3-642-15549-9_56

I. Tsochantaridis, T. Joachims, T. Hofmann, and Y. Altun, Large margin methods for structured and interdependent output variables, JMLR, issue.3, 2005.

N. Usunier, D. Buffoni, and P. Gallinari, Ranking with ordered weighted pairwise classification, Proceedings of the 26th Annual International Conference on Machine Learning, ICML '09, 2009.
DOI : 10.1145/1553374.1553509

URL : https://hal.archives-ouvertes.fr/hal-01297974

A. Vedaldi and A. Zisserman, Efficient additive kernels via explicit feature maps, CVPR, 2010.
DOI : 10.1109/cvpr.2010.5539949

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.167.7024

J. Wang, J. Yang, K. Yu, F. Lv, T. Huang et al., Localityconstrained linear coding for image classification, CVPR, 2010.

J. Weston, S. Bengio, and N. Usunier, Large scale image annotation: learning??to??rank with??joint word-image embeddings, Machine Learning, vol.5, issue.1, 2007.
DOI : 10.1007/s10994-010-5198-3

J. Weston and C. Watkins, Multi-class support vector machines, 1998.

J. Xu, T. Liu, M. Lu, H. Li, and W. Ma, Directly optimizing evaluation measures in learning to rank, Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '08, 2008.
DOI : 10.1145/1390334.1390355

Y. Yue, T. Finley, F. Radlinski, and T. Joachims, A support vector method for optimizing average precision, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '07, 2007.
DOI : 10.1145/1277741.1277790

Z. Zhou, K. Yu, T. Zhang, and T. Huang, Image Classification Using Super-Vector Coding of Local Image Descriptors, ECCV, 2010.
DOI : 10.1007/978-3-642-15555-0_11