J. Ah-pine, C. Cifarelli, S. Clinchant, G. Csurka, and J. Renders, XRCE's participation to ImageCLEF, Working Notes of the CLEF Workshop. CLEF Campaign, 2008.

J. Ah-pine, S. Clinchant, G. Csurka, F. Perronnin, and J. Renders, Leveraging Image, Text and Cross???media Similarities for Diversity???focused Multimedia Retrieval, ImageCLEF -Experimental Evaluation in Visual Information Retrieval, 2010.
DOI : 10.1007/978-3-642-15181-1_17
URL : https://hal.archives-ouvertes.fr/hal-01504565

N. Ailon, A Simple Linear Ranking Algorithm Using Query Dependent Intercept Variables, 2009.
DOI : 10.1007/978-3-642-00958-7_67

T. Arni, P. Clough, M. Sanderson, and M. Grubinger, Overview of the Image- CLEFphoto 2008 photographic retrieval task, Working Notes of the CLEF Workshop. CLEF Campaign, 2008.

K. Barnard, P. Duygulu, D. Forsyth, N. De-freitas, D. Blei et al., Matching words and pictures, JMLR, vol.3, pp.1107-1135, 2003.

R. Bekkerman and J. Jeon, Multi-modal Clustering for Multimedia Collections, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383223

T. Berg, A. Berg, J. Edwards, M. Maire, R. White et al., Names and faces in the news, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., 2004.
DOI : 10.1109/CVPR.2004.1315253

C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds et al., Learning to rank using gradient descent, Proceedings of the 22nd international conference on Machine learning , ICML '05, pp.89-96, 2005.
DOI : 10.1145/1102351.1102363

G. Carneiro, A. Chan, P. Moreno, and N. Vasconcelos, Supervised learning of semantic classes for image annotation and retrieval. PAMI, 2007.

Y. Chang and H. Chen, Approaches of Using a Word-Image Ontology and an Annotated Image Corpus as Intermedia for Cross-Language Image Retrieval, Working Notes of the CLEF Workshop, 2006.
DOI : 10.1007/978-3-540-74999-8_76

G. Chechik, V. Sharma, U. Shalit, and S. Bengio, Large Scale Online Learning of Image Similarity through Ranking, JMLR, vol.11, pp.1109-1135, 2010.
DOI : 10.1007/978-3-642-02172-5_2

O. Chum, J. Philbin, J. Sivic, M. Isard, and A. Zisserman, Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval, 2007 IEEE 11th International Conference on Computer Vision, 2007.
DOI : 10.1109/ICCV.2007.4408891

S. Clinchant, J. Renders, and G. Csurka, XRCE's participation to Image- CLEFphoto, Working Notes of the CLEF Workshop, 2007.

C. Cusano, G. Ciocca, and R. Schettini, Image annotation using SVM, Proceedings Internet imaging (SPIE), 2004.
DOI : 10.1117/12.526746
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.107.3235

A. Depeursinge and H. Müller, Fusion Techniques for Combining Textual and Visual Information Retrieval, ImageCLEF -Experimental Evaluation in Visual Information Retrieval, 2010.
DOI : 10.1007/978-3-642-15181-1_6

P. Duygulu, K. Barnard, N. De-freitas, and D. Forsyth, Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary, ECCV, 2002.
DOI : 10.1007/3-540-47979-1_7

M. Everingham, L. Van-gool, C. Williams, J. Winn, and A. Zisserman, The Pascal Visual Object Classes (VOC) Challenge, International Journal of Computer Vision, vol.73, issue.2, 2007.
DOI : 10.1007/s11263-009-0275-4

S. Feng, R. Manmatha, and V. Lavrenko, Multiple Bernoulli relevance models for image and video annotation, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., 2004.
DOI : 10.1109/CVPR.2004.1315274

A. Frome, J. Malik, and Y. Singer, Image retrieval and classification using local distance functions, NIPS, pp.417-424, 2007.

D. Grangier and S. Bengio, A discriminative kernel-based model to rank images from text queries, 2008.
DOI : 10.1109/tpami.2007.70791

M. Grubinger, P. Clough, H. Müller, and T. Deselaers, The IAPR benchmark: A new evaluation resource for visual information systems, Int. Conf. on Language Resources and Evaluation, 2006.

M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid, TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459266
URL : https://hal.archives-ouvertes.fr/inria-00439276

M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid, Face Recognition from Caption-Based Supervision, International Journal of Computer Vision, vol.57, issue.2, 2011.
DOI : 10.1007/s11263-011-0447-x
URL : https://hal.archives-ouvertes.fr/inria-00522185

T. Hertz, A. Bar-hillel, and D. Weinshall, Learning distance functions for image retrieval, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., 2004.
DOI : 10.1109/CVPR.2004.1315215
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.106.5883

H. Jégou, M. Douze, and C. Schmid, Packing bag-of-features, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459419

J. Jeon, V. Lavrenko, and R. Manmatha, Automatic image annotation and retrieval using cross-media relevance models, Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval , SIGIR '03, 2003.
DOI : 10.1145/860435.860459
URL : http://ciir.cs.umass.edu/pubfiles/mm-41.pdf

T. Joachims, A support vector method for multivariate performance measures, Proceedings of the 22nd international conference on Machine learning , ICML '05, 2005.
DOI : 10.1145/1102351.1102399
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.140.1854

J. Kludas, S. Marchand-maillet, and E. Bruno, Information Fusion in Multimedia Information Retrieval, Workshop on Adaptive Multimedia Retrieval, 2007.
DOI : 10.1007/978-3-540-79860-6_12

J. Krapac, M. Allan, J. Verbeek, and F. Jurie, Improving web image search results using query-relative classifiers, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5540092
URL : https://hal.archives-ouvertes.fr/inria-00548636

I. Laptev, M. Marsza?ek, C. Schmid, and B. Rozenfeld, Learning realistic human actions from movies, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587756
URL : https://hal.archives-ouvertes.fr/inria-00548659

V. Lavrenko, R. Manmatha, and J. Jeon, A model for learning the semantics of pictures, NIPS, 2003.

D. Lewis, Applying support vector machines to the TREC-2001 batch filtering and routing tasks, Proceedings of (TREC), 2001.

H. Li, Learning to Rank for Information Retrieval and Natural Language Processing Image annotation by large-scale content based image retrieval, ACM Multimedia, 2006.

J. Liu, M. Li, Q. Liu, H. Lu, and S. Ma, Image annotation via graph learning, Pattern Recognition, vol.42, issue.2, 2009.
DOI : 10.1016/j.patcog.2008.04.012

N. Maillot, J. Chevallet, V. Valea, and J. Lim, IPAL inter-media pseudorelevance feedback approach to ImageCLEF, Working Notes of the CLEF Workshop, 2006.
DOI : 10.1007/978-3-540-74999-8_92
URL : https://hal.archives-ouvertes.fr/hal-00954108

A. Makadia, V. Pavlovic, and S. Kumar, A New Baseline for Image Annotation, ECCV, 2008.
DOI : 10.1007/978-3-540-88690-7_24
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.142.3054

T. Mensink, G. Csurka, F. Perronnin, J. Sánchez, and J. Verbeek, LEAR and XRCE's participation to visual concept detection task, Working Notes of the CLEF Workshop, 2010.

T. Mensink, G. Csurka, and J. Verbeek, Trans Media Relevance Feedback for Image Autoannotation, Procedings of the British Machine Vision Conference 2010, 2010.
DOI : 10.5244/C.24.20
URL : https://hal.archives-ouvertes.fr/inria-00548632

F. Monay and D. Gatica-perez, PLSA-based image auto-annotation, Proceedings of the 12th annual ACM international conference on Multimedia , MULTIMEDIA '04, 2004.
DOI : 10.1145/1027527.1027608

H. Müller, P. Clough, T. Deselaers, and B. Caputo, ImageCLEF -Experimental Evaluation in Visual Information Retrieval, 2010.

S. Navarro, M. García, F. Llopis, R. Díaz, M. Muñoz et al., Text-mess in the ImageCLEFphoto08 task, Working Notes of the CLEF Workshop, 2008.

A. Oliva and A. Torralba, Modeling the shape of the scene: a holistic representation of the spatial envelope, 2001.

J. Pan, H. Yang, C. Faloutsos, and P. Duygulu, Automatic multimedia crossmodal correlation discovery, ACM SIGKDD, 2004.
DOI : 10.1145/1014052.1014135
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.58.3860

F. Perronnin and C. Dance, Fisher Kernels on Visual Vocabularies for Image Categorization, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383266
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.71.7388

F. Perronnin, Y. Liu, and J. Renders, A family of contextual measures of similarity between distributions with application to image retrieval, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206505
URL : https://hal.archives-ouvertes.fr/hal-01437742

J. Ponte and W. Croft, A language modelling approach to information retrieval, SIGIR, 1998.
DOI : 10.1145/290941.291008
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.54.6410

T. Qin, T. Liu, J. Xu, and H. Li, LETOR: A benchmark collection for research on learning to rank for information retrieval, Information Retrieval, vol.44, issue.2, pp.346-374, 2010.
DOI : 10.1007/s10791-009-9123-y

G. Salton and C. Buckley, Improving retrieval performance by relevance feedback, Journal of the American Society for Information Science, vol.41, issue.4, 1990.
DOI : 10.1002/(SICI)1097-4571(199006)41:4<288::AID-ASI8>3.0.CO;2-H
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.92.3553

F. Schroff, A. Criminisi, and A. Zisserman, Harvesting image databases from the web, ICCV, 2007.
DOI : 10.1109/iccv.2007.4409099
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.117.4040

C. Snoek, M. Worring, and A. Smeulders, Early versus late fusion in semantic video analysis, Proceedings of the 13th annual ACM international conference on Multimedia , MULTIMEDIA '05, 2005.
DOI : 10.1145/1101149.1101236

J. Verbeek, M. Guillaumin, T. Mensink, and C. Schmid, Image annotation with tagprop on the MIRFLICKR set, Proceedings of the international conference on Multimedia information retrieval, MIR '10, 2010.
DOI : 10.1145/1743384.1743476
URL : https://hal.archives-ouvertes.fr/inria-00548628

X. Wang, L. Zhang, F. Jing, and W. Ma, AnnoSearch: Image auto-annotation by search, CVPR, 2006.

J. Weston, S. Bengio, and N. Usunier, Large scale image annotation: learning??to??rank with??joint word-image embeddings, ECML, 2010.
DOI : 10.1007/s10994-010-5198-3

O. Yakhnenko and V. Honavar, Annotating images and image objects using a hierarchical dirichlet process model, Proceedings of the 9th International Workshop on Multimedia Data Mining held in conjunction with the ACM SIGKDD 2008, MDM '08, 2008.
DOI : 10.1145/1509212.1509213

Y. Yue, T. Finley, F. Radlinski, and T. Joachims, A support vector method for optimizing average precision, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '07, 2007.
DOI : 10.1145/1277741.1277790
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.535.8300

H. Zhang, A. Berg, M. Maire, and J. Malik, SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.301

Z. Zheng, H. Zha, T. Zhang, O. Chapelle, K. Chen et al., A general boosting method and its application to learning ranking functions for web search, NIPS, 2008.

R. Inria, Inovallée 655 avenue de l'Europe Montbonnot 38334 Saint Ismier Cedex Publisher Inria Domaine de Voluceau -Rocquencourt BP 105 -78153 Le Chesnay Cedex inria