R. Aly, C. Hauff, W. Heeren, D. Hiemstra, F. De-jong et al., The lowlands team at TRECVid 2007, Proceedings of the 7th TRECVid Workshop, 2007.

R. Aly, D. Hiemstra, A. P. De-vries, and H. Rode, The lowlands team at TRECVid, Proceedings of the 8th TRECVid Workshop, 2008.

R. Arandjelovi´carandjelovi´c and A. Zisserman, Multiple queries for large scale specific object retrieval, Proceedings of the British Machine Vision Conference, 2012.

R. Arandjelovi´carandjelovi´c and A. Zisserman, Three things everyone should know to improve object retrieval, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2012.

M. Ayari, J. Delhumeau, M. Douze, H. Jégou, D. Potapov et al., INRIA@TRECVID'2011: Copy Detection & Multimedia Event Detection, TRECVID, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00648016

K. Chatfield and A. Zisserman, VISOR: Towards On-the-Fly Large-Scale Object Category Retrieval, Asian Conference on Computer Vision, 2012.
DOI : 10.1007/978-3-642-37444-9_34

G. Csurka, C. Dance, L. Fan, J. Willamowski, and C. Bray, Visual categorization with bags of keypoints, ECCV Workshop Statistical Learning in Computer Vision, 2004.

N. Dalal, B. Triggs, and C. Schmid, Human Detection Using Oriented Histograms of Flow and Appearance, European Conference on Computer Vision, 2006.
DOI : 10.1023/A:1008162616689

URL : https://hal.archives-ouvertes.fr/inria-00548587

M. Everingham, J. Sivic, and A. Zisserman, Taking the bite out of automatic naming of characters in TV video, Image and Vision Computing, 2009.

P. Felzenszwalb and D. Huttenlocher, Pictorial Structures for Object Recognition, International Journal of Computer Vision, vol.61, issue.1, 2005.
DOI : 10.1023/B:VISI.0000042934.15159.49

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.12.6365

G. B. Huang, M. Ramesh, T. Berg, and E. Learned-miller, Labeled faces in the wild: A database for studying face recognition in unconstrained environments, 2007.

H. Jégou, M. Douze, G. Gravier, C. Schmid, and P. Gros, INRIA LEAR- TEXMEX: Video copy detection task, Proc. of the TRECVid, 2010.

J. Matas, O. Chum, M. Urban, and T. Pajdla, Robust wide baseline stereo from maximally stable extremal regions, Proceedings of the British Machine Vision Conference, pp.384-393, 2002.

P. Natarajan, S. Wu, S. Vitaladevuni, X. Zhuang, S. Tsakalidis et al., Multimodal feature fusion for robust event detection in web videos, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6247814

M. Osian and L. V. , Video shot characterization, Proceedings of the 1th TRECVid Workshop, 2003.
DOI : 10.1007/s00138-004-0141-x

P. Over, G. Awad, M. Michel, J. Fiscus, G. Sanders et al., Trecvid 2012 ? an overview of the goals, tasks, data, evaluation mechanisms and metrics, Proceedings of TRECVID 2012, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00953826

P. Over, G. Awad, M. Michel, J. Fiscus, G. Sanders et al., Trecvid 2012 ? an overview of the goals, tasks, data, evaluation mechanisms and metrics, Proceedings of TRECVID 2012, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00953826

O. M. Parkhi, A. Vedaldi, and A. Zisserman, On-the-fly specific person retrieval, 2012 13th International Workshop on Image Analysis for Multimedia Interactive Services, 2012.
DOI : 10.1109/WIAMIS.2012.6226775

M. Perd-'och, O. Chum, and J. Matas, Efficient representation of local geometry for large scale object retrieval, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009.

F. Perronnin and C. R. Dance, Fisher Kernels on Visual Vocabularies for Image Categorization, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383266

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, Proceedings of the European Conference on Computer Vision, 2010.
DOI : 10.1007/978-3-642-15561-1_11

URL : https://hal.archives-ouvertes.fr/inria-00548630

J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman, Object retrieval with large vocabularies and fast spatial matching, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383172

J. A. Shaw, E. A. Fox, J. A. Shaw, and E. A. Fox, Combination of multiple searches, The Third Text REtrieval Conference (TREC-3), pp.243-252, 1994.

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, pp.1470-1477, 2003.
DOI : 10.1109/ICCV.2003.1238663

A. F. Smeaton, P. Over, and W. Kraaij, Evaluation campaigns and TRECVid, Proceedings of the 8th ACM international workshop on Multimedia information retrieval , MIR '06, pp.321-330, 2006.
DOI : 10.1145/1178677.1178722

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.329.3415

S. Vempati, M. Jain, O. M. Parkhi, C. V. Jawahar, M. Marszalek et al., Oxford-IIIT TRECVID 2009 -Notebook Paper, Proceedings of the 5th TRECVid Workshop, 2009.

H. Wang, A. Kläser, C. Schmid, and C. Liu, Action recognition by dense trajectories, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995407

URL : https://hal.archives-ouvertes.fr/inria-00583818

R. Yan, Probabilistic Models for Combining Diverse Knowledge Sources in Multimedia Retrieval, 2006.