M. Andriluka, S. Roth, and B. Schiele, Pictorial structures revisited: People detection and articulated pose estimation, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206754
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.395.158

A. Bobick and J. Davis, The recognition of human movement using temporal templates, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.23, issue.3, pp.257-276, 2001.
DOI : 10.1109/34.910878

L. Bourdev and J. Malik, Poselets: Body part detectors trained using 3D human pose annotations, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459303

T. Brox, L. Bourdev, S. Maji, and J. Malik, Object segmentation by alignment of poselet activations to image contours, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995659

G. Csurka, C. Bray, C. Dance, and L. Fan, Visual categorization with bags of keypoints, WS-SLCV, ECCV, 2004.

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.886-893, 2005.
DOI : 10.1109/CVPR.2005.177
URL : https://hal.archives-ouvertes.fr/inria-00548512

V. Delaitre, I. Laptev, and J. Sivic, Recognizing human actions in still images: a study of bag-of-features and part-based representations, Procedings of the British Machine Vision Conference 2010
DOI : 10.5244/C.24.97
URL : https://hal.archives-ouvertes.fr/hal-01060885

C. Desai, D. Ramanan, and C. Fowlkes, Discriminative models for multi-class object layout, ICCV, 2009.

C. Desai, D. Ramanan, and C. Fowlkes, Discriminative models for static human-object interactions, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Workshops, 2010.
DOI : 10.1109/CVPRW.2010.5543176

M. Everingham, L. Van-gool, C. Williams, J. Winn, and A. Zisserman, The pascal visual object classes (voc) challenge. IJCV, 2010.

A. Farhadi, I. Endres, D. Hoiem, and D. Forsyth, Describing objects by their attributes, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206772
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.149.9539

L. Fei-fei and P. Perona, A Bayesian Hierarchical Model for Learning Natural Scene Categories, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005.
DOI : 10.1109/CVPR.2005.16

P. Felzenszwalb, R. Girshick, D. Mcallester, and D. Ramanan, Object Detection with Discriminatively Trained Part-Based Models, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.9, 2009.
DOI : 10.1109/TPAMI.2009.167
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.153.2745

P. Felzenszwalb and D. Huttenlocher, Distance transforms of sampled functions, 1963.

V. Ferrari, M. Marin-jimenez, and A. Zisserman, Pose search: Retrieving people using their pose, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206495
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.216.8291

Y. Freund and R. Schapire, A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting, Journal of Computer and System Sciences, vol.55, issue.1, pp.119-139, 1997.
DOI : 10.1006/jcss.1997.1504

A. Gupta, A. Kembhavi, and L. Davis, Observing Human-Object Interactions: Using Spatial and Functional Compatibility for Recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.31, issue.10, pp.311775-1789, 2009.
DOI : 10.1109/TPAMI.2009.83

H. Harzallah, F. Jurie, and C. Schmid, Combining efficient object localization and image classification, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459257
URL : https://hal.archives-ouvertes.fr/inria-00439516

D. Hoiem, A. Efros, and M. Hebert, Putting objects in perspective, CVPR, 2006.

N. Ikizler, R. G. Cinbis, S. Pehlivan, and P. Duygulu, Recognizing actions from still images, 2008 19th International Conference on Pattern Recognition, 2008.
DOI : 10.1109/ICPR.2008.4761663
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.159.265

S. Johnson and M. Everingham, Learning effective human pose estimation from inaccurate annotation, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995318
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.638.2045

C. Lampert, H. Nickisch, and S. Harmeling, Learning to detect unseen object classes by between-class attribute transfer, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206594
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.165.9750

I. Laptev, M. Marsza?ek, C. Schmid, and B. Rozenfeld, Learning realistic human actions from movies, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587756
URL : https://hal.archives-ouvertes.fr/inria-00548659

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.2169-2178, 2006.
DOI : 10.1109/CVPR.2006.68
URL : https://hal.archives-ouvertes.fr/inria-00548585

L. Li, H. Su, E. Xing, and L. Fei-fei, Object bank: A high-level image representation for scene classification and semantic feature sparsification, NIPS, 2010.

S. Maji, L. Bourdev, and J. Malik, Action recognition from a distributed representation of pose and appearance, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995631

T. B. Moeslund, A. Hilton, and V. Kruger, A survey of advances in vision-based human motion capture and analysis, Computer Vision and Image Understanding, vol.104, issue.2-3, pp.90-126, 2006.
DOI : 10.1016/j.cviu.2006.08.002

A. Rabinovich, A. Vedaldi, C. Galleguillos, E. Wiewiora, and S. Belongie, Objects in Context, 2007 IEEE 11th International Conference on Computer Vision, 2007.
DOI : 10.1109/ICCV.2007.4408986

B. Sapp, A. Toshev, and B. Taskar, Cascaded Models for Articulated Pose Estimation, ECCV, 2010.
DOI : 10.1007/978-3-642-15552-9_30
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.168.5851

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, 2003.
DOI : 10.1109/ICCV.2003.1238663

A. Torralba, Contextual priming for object detection, International Journal of Computer Vision, vol.53, issue.2, pp.169-191, 2003.
DOI : 10.1023/A:1023052124951

A. Vedaldi, V. Gulshan, M. Varma, and A. Zisserman, Multiple kernels for object detection, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459183
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.163.5316

Y. Wang, H. Jiang, M. S. Drew, Z. N. Li, and G. Mori, Unsupervised Discovery of Action Classes, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.1654-1661, 2006.
DOI : 10.1109/CVPR.2006.321

W. Yang, Y. Wang, and G. Mori, Recognizing human actions from still images with latent poses, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539879
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.167.3890

B. Yao and L. Fei-fei, Grouplet: A structured image representation for recognizing human and object interactions, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5540234

B. Yao and L. Fei-fei, Modeling mutual context of object and human pose in human-object interaction activities, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5540235

J. Zhang, M. Marszalek, S. Lazebnik, and C. Schmid, Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study, International Journal of Computer Vision, vol.36, issue.1, pp.213-238, 2007.
DOI : 10.1007/s11263-006-9794-4
URL : https://hal.archives-ouvertes.fr/inria-00548574