P. Felzenszwalb, R. Girshick, D. Mcallester, and D. Ramanan, Object Detection with Discriminatively Trained Part-Based Models, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.9, pp.1627-1645, 2010.
DOI : 10.1109/TPAMI.2009.167

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.153.2745

L. Zhu, Y. Chen, A. Yuille, and W. Freeman, Latent hierarchical structural learning for object detection, In: CVPR, pp.1062-1069, 2010.

L. Bourdev and J. Malik, Poselets: Body part detectors trained using 3D human pose annotations, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459303

J. Deng, W. Dong, R. Socher, L. J. Li, K. Li et al., Imagenet: A large-scale hierarchical image database, In: CVPR, 2009.

M. Fischler and R. Elschlager, The Representation and Matching of Pictorial Structures, IEEE Transactions on Computers, vol.22, issue.1, pp.67-92, 1973.
DOI : 10.1109/T-C.1973.223602

P. Felzenszwalb and D. Huttenlocher, Pictorial Structures for Object Recognition, International Journal of Computer Vision, vol.61, issue.1, pp.55-79, 2005.
DOI : 10.1023/B:VISI.0000042934.15159.49

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.12.6365

D. Ramanan, Learning to parse images of articulated bodies, In: NIPS, 2006.

V. Ferrari, M. Marin-jimenez, and A. Zisserman, Pose search: Retrieving people using their pose, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206495

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.216.8291

Y. Yang and D. Ramanan, Articulated pose estimation with flexible mixtures-ofparts, In: CVPR, pp.1385-1392, 2011.
DOI : 10.1109/cvpr.2011.5995741

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.661.1333

W. Yang, Y. Wang, and G. Mori, Recognizing human actions from still images with latent poses, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.2030-2037, 2010.
DOI : 10.1109/CVPR.2010.5539879

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.167.3890

D. Cristinacce and T. Cootes, Feature Detection and Tracking with Constrained Local Models, Procedings of the British Machine Vision Conference 2006, 2006.
DOI : 10.5244/C.20.95

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.108.6210

N. Parizi, S. Oberlin, J. Felzenszwalb, and P. , Reconfigurable models for scene recognition, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6248001

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.232.85

P. Ott and M. Everingham, Shared parts for deformable part-based models, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995357

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.225.6520

Y. Wang, D. Tran, and Z. Liao, Learning hierarchical poselets for human parsing, CVPR 2011, pp.1705-1712, 2011.
DOI : 10.1109/CVPR.2011.5995519

S. Branson, S. Belongie, and P. Perona, Strong supervision from weak annotation: Interactive training of deformable part models, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126450

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.886-893, 2005.
DOI : 10.1109/CVPR.2005.177

URL : https://hal.archives-ouvertes.fr/inria-00548512

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, pp.1470-1477, 2003.
DOI : 10.1109/ICCV.2003.1238663

J. Zhang, M. Marszalek, S. Lazebnik, and C. Schmid, Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study, International Journal of Computer Vision, vol.36, issue.1, pp.73-213, 2007.
DOI : 10.1007/s11263-006-9794-4

URL : https://hal.archives-ouvertes.fr/inria-00548574

Y. Chen, L. Zhu, and A. Yuille, Active Mask Hierarchies for Object Detection, In: ECCV, 2010.
DOI : 10.1007/978-3-642-15555-0_4

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.400.9818

O. Parkhi, A. Vedaldi, C. V. Jawahar, and A. Zisserman, The truth about cats and dogs, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126398

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.371.1670

L. Bourdev, S. Maji, T. Brox, and J. Malik, Detecting People Using Mutually Consistent Poselet Activations, 2010.
DOI : 10.1007/978-3-642-15567-3_13

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.178.1823

S. Johnson and M. Everingham, Learning effective human pose estimation from inaccurate annotation, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995318

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.638.2045

M. Sun and S. Savarese, Articulated part-based model for joint object detection and pose estimation, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126309

X. Zhu and D. Ramanan, Face detection, pose estimation, and landmark localization in the wild, In: CVPR, 2012.

D. Koller and N. Friedman, Probabilistic Graphical Models: Principles and Techniques, 2009.

V. Ferrari, M. Marin-jimenez, and A. Zisserman, Progressive search space reduction for human pose estimation, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587468

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.321.2867