D. Blei and M. Jordan, Modeling annotated data, Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval , SIGIR '03, 2003.
DOI : 10.1145/860435.860460

E. Borenstein, E. Sharon, and S. Ullman, Combining Top-Down and Bottom-Up Segmentation, 2004 Conference on Computer Vision and Pattern Recognition Workshop, 2004.
DOI : 10.1109/CVPR.2004.314

G. Csurka, C. Dance, L. Fan, J. Williamowski, and C. Bray, Visual categorization with bags of keypoints, ECCV workshop on Statistical Learning in ComputerVision, pp.59-74, 2004.

L. Fei-fei and P. Perona, A Bayesian Hierarchical Model for Learning Natural Scene Categories, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005.
DOI : 10.1109/CVPR.2005.16

R. Fergus, L. Fei-fei, P. Perona, and A. Zisserman, Learning object categories from Google's image search, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.5228-5235, 2005.
DOI : 10.1109/ICCV.2005.142

M. Flickner, H. Sawhney, W. Niblack, J. Ashley, Q. Huang et al., Query by image and video content: the qbic system, Computer, issue.9, pp.2823-2855, 1995.

M. Fritz, B. Leibe, B. Caputo, and B. Schiele, Integrating representative and discriminative models for object category detection, ICCV, 2005.

T. Griffiths and M. Steyvers, Finding scientific topics, PNAS, 2004.
DOI : 10.1073/pnas.0307752101

T. Hofmann, Unsupervised learning by probabilistic latent semantic analysis, Machine Learning, pp.177-196, 2001.

F. Jurie and B. Triggs, Creating efficient codebooks for visual recognition, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005.
DOI : 10.1109/ICCV.2005.66

URL : https://hal.archives-ouvertes.fr/inria-00548511

S. Kumar and M. Hebert, Discriminative Random Fields, International Journal of Computer Vision, vol.21, issue.1, pp.179-201, 2006.
DOI : 10.1007/s11263-006-7007-9

S. Lazebnik, C. Schmid, and J. Ponce, Semi-Local Affine Parts for Object Recognition, Procedings of the British Machine Vision Conference 2004, pp.779-788, 2004.
DOI : 10.5244/C.18.98

URL : https://hal.archives-ouvertes.fr/inria-00548542

S. Lazebnik, C. Schmid, and J. Ponce, A maximum entropy framework for part-based texture and object recognition, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005.
DOI : 10.1109/ICCV.2005.10

URL : https://hal.archives-ouvertes.fr/inria-00548510

B. Leibe and B. Schiele, Interleaved Object Categorization and Segmentation, Procedings of the British Machine Vision Conference 2003, 2003.
DOI : 10.5244/C.17.78

D. J. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94

F. Perronnin, C. Dance, G. Csurka, and M. Bressan, Adapted Vocabularies for Generic Visual Categorization, ECCV, 2006.
DOI : 10.1007/11744085_36

A. Quattoni, M. Collins, and T. Darrell, Conditional random fields for object recognition, NIPS, pp.1097-1104, 2004.

P. Quelhas, F. Monay, J. Odobez, D. Gatica-perez, T. Tuytelaars et al., Modeling scenes with local descriptors and latent aspects, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005.
DOI : 10.1109/ICCV.2005.152

J. Sivic, B. Russell, A. Efros, A. Zisserman, and B. Freeman, Discovering objects and their location in images, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005.
DOI : 10.1109/ICCV.2005.77

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, pp.1470-1477, 2003.
DOI : 10.1109/ICCV.2003.1238663

E. Sudderth, A. Torralba, W. Freeman, and A. Willsky, Describing visual scenes using transformed dirichlet processes, NIPS, 2006.

J. Winn, A. Criminisi, and T. Minka, Object categorization by learned universal visual dictionary, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005.
DOI : 10.1109/ICCV.2005.171

J. Winn and J. Shotton, The Layout Consistent Random Field for Recognizing and Segmenting Partially Occluded Objects, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 1 (CVPR'06), pp.37-44, 2006.
DOI : 10.1109/CVPR.2006.305