S. Agarwal, A. Awan, and D. Roth, Learning to detect objects in images via a sparse, part-based representation, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.26, issue.11, 2004.
DOI : 10.1109/TPAMI.2004.108

C. Carson, S. Belongie, H. Greenspan, and J. Malik, Blobworld: image segmentation using expectation-maximization and its application to image querying, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.8, pp.1026-1038, 2002.
DOI : 10.1109/TPAMI.2002.1023800

D. Comaniciu and P. Meer, Mean shift: a robust approach toward feature space analysis, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.5, 2002.
DOI : 10.1109/34.1000236

G. Dorkó and C. Schmid, Selection of scale-invariant parts for object class recognition, Proceedings Ninth IEEE International Conference on Computer Vision, pp.634-640, 2003.
DOI : 10.1109/ICCV.2003.1238407

F. Jurie and B. Triggs, Creating efficient codebooks for visual recognition, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005.
DOI : 10.1109/ICCV.2005.66

URL : https://hal.archives-ouvertes.fr/inria-00548511

T. Kadir, A. Zisserman, and M. Brady, An Affine Invariant Salient Region Detector, ECCV, 2004.
DOI : 10.1007/978-3-540-24670-1_18

M. Kumar, P. Torr, and A. Zisserman, OBJ CUT, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005.
DOI : 10.1109/CVPR.2005.249

S. Kumar and M. Hebert, Discriminative fields for modeling spatial dependencies in natural images, NIPS, 2004.

S. Lazebnik, C. Schmid, and J. Ponce, Semi-Local Affine Parts for Object Recognition, Procedings of the British Machine Vision Conference 2004, 2004.
DOI : 10.5244/C.18.98

URL : https://hal.archives-ouvertes.fr/inria-00548542

B. Leibe and B. Schiele, Scale-Invariant Object Categorization Using a Scale-Adaptive Mean-Shift Search
DOI : 10.1007/978-3-540-28649-3_18

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.58.411

T. Lindeberg and J. Garding, Shape-adapted smoothing in estimation of 3D depth cues from affine distortions of local 2D brightness structure, ECCV, pp.389-400, 1994.

D. G. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94

J. Malik, S. Belongie, J. Shi, and T. Leung, Textons, contours and regions: Cue combination in image segmentation, ICCV, 1999.

D. Martin, C. Fowlkes, D. Tal, and J. Malik, A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001, 2001.
DOI : 10.1109/ICCV.2001.937655

K. Mikolajczyk and C. Schmid, Scale & Affine Invariant Interest Point Detectors, International Journal of Computer Vision, vol.60, issue.1, pp.63-86, 2004.
DOI : 10.1023/B:VISI.0000027790.02288.f2

URL : https://hal.archives-ouvertes.fr/inria-00548554

G. Mori and J. Malik, Estimating Human Body Configurations Using Shape Context Matching, Workshop on Models versus Exemplars in Computer Vision, CVPR, 2001.
DOI : 10.1007/3-540-47977-5_44

K. Murphy, A. Torralba, and W. Freeman, Using the forest to see the trees:a graphical model relating features, objects and scenes, NIPS, 2003.

A. Opelt and A. Pinz, Object Localization with Boosting and Weak Supervision for Generic Object Recognition, SCIA, 2005.
DOI : 10.1007/11499145_87

X. Ren and J. Malik, Learning a classification model for segmentation, Proceedings Ninth IEEE International Conference on Computer Vision, 2003.
DOI : 10.1109/ICCV.2003.1238308

C. Schmid, Constructing models for content-based image retrieval, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, 2001.
DOI : 10.1109/CVPR.2001.990922

URL : https://hal.archives-ouvertes.fr/inria-00548274

H. Schneiderman and T. Kanade, A statistical method for 3D object detection applied to faces and cars, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662), 2000.
DOI : 10.1109/CVPR.2000.855895

Z. Tu, Z. Chen, A. L. Yuille, and S. Zhu, Image parsing: Unifying segmentation, detection, and recognition, IJCV, 2005.
DOI : 10.1007/11957959_28

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.10.4157

P. Viola, M. J. Jones, and D. Snow, Detecting pedestrians using patterns of motion and appearance, ICCV, 2003.

M. Weber, W. Einhuser, M. Welling, and P. Perona, Viewpoint-invariant learning and detection of human heads, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580), 2000.
DOI : 10.1109/AFGR.2000.840607

S. Yu and J. Shi, Object-specific figure-ground segregation, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings., 2003.
DOI : 10.1109/CVPR.2003.1211450

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.12.9902