M. Andriluka, L. Pishchulin, P. Gehler, and B. Schiele, 2D Human Pose Estimation: New Benchmark and State of the Art Analysis, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.471

L. Cao, Z. Liu, and T. Huang, Cross-dataset action detection, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539875

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.459.6620

A. Gaidon, Z. Harchaoui, and C. Schmid, Temporal Localization of Actions with Actoms, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, issue.11, 2013.
DOI : 10.1109/TPAMI.2013.65

URL : https://hal.archives-ouvertes.fr/hal-00804627

R. Girshick, J. Donahue, T. Darrell, and J. Malik, Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.81

URL : http://arxiv.org/abs/1311.2524

G. Gkioxari and J. Malik, Finding action tubes, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
DOI : 10.1109/CVPR.2015.7298676

S. Hare, A. Saffari, and P. H. Torr, Struck: Structured output tracking with kernels, ICCV, 2011.
DOI : 10.1109/tpami.2015.2509974

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.294.5858

M. Jain, J. Van-gemert, H. Jégou, P. Bouthemy, and C. Snoek, Action Localization with Tubelets from Motion, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.100

URL : https://hal.archives-ouvertes.fr/hal-00996844

H. Jhuang, J. Gall, S. Zuffi, C. Schmid, and M. J. Black, Towards Understanding Action Recognition, 2013 IEEE International Conference on Computer Vision, 2013.
DOI : 10.1109/ICCV.2013.396

URL : https://hal.archives-ouvertes.fr/hal-00906902

S. Johnson and M. Everingham, Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation, Procedings of the British Machine Vision Conference 2010, 2010.
DOI : 10.5244/C.24.12

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.175.2192

A. Kar, S. Tulsiani, J. Carreira, and J. Malik, Amodal Completion and Size Constancy in Natural Scenes, 2015 IEEE International Conference on Computer Vision (ICCV), p.2015
DOI : 10.1109/ICCV.2015.23

URL : http://arxiv.org/abs/1509.08147

A. Kläser, M. Marszalek, C. Schmid, and A. Zisserman, Human Focused Action Localization in Video, International Workshop on Sign, Gesture, and Activity (SGA), 2010.
DOI : 10.1007/978-3-642-35749-7_17

T. Lan, Y. Wang, and G. Mori, Discriminative figure-centric models for joint action localization and recognition, ICCV, 2011.

I. Laptev and P. Pérez, Retrieving actions in movies, 2007 IEEE 11th International Conference on Computer Vision, 2007.
DOI : 10.1109/ICCV.2007.4409105

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.80.1618

M. M. Puscas, E. Sangineto, D. Culibrk, and N. Sebe, Unsupervised Tube Extraction Using Transductive Learning and Dense Trajectories, 2015 IEEE International Conference on Computer Vision (ICCV), 2015.
DOI : 10.1109/ICCV.2015.193

D. Oneata, J. Revaud, J. Verbeek, and C. Schmid, Spatio-temporal Object Detection Proposals, ECCV, 2014.
DOI : 10.1007/978-3-319-10578-9_48

URL : https://hal.archives-ouvertes.fr/hal-01021902

X. Peng and C. Schmid, Multi-region Two-Stream R-CNN for Action Detection, ECCV, 2016.
DOI : 10.1109/CVPR.2015.7298735

URL : https://hal.archives-ouvertes.fr/hal-01349107

S. Ren, K. He, R. Girshick, and J. Sun, Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, NIPS, 2015.
DOI : 10.1109/TPAMI.2016.2577031

URL : http://arxiv.org/abs/1506.01497

G. Rogez, P. Weinzaepfel, and C. Schmid, LCR-Net: Localization-Classification- Regression for Human Pose, CVPR, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01505085

S. Saha, G. Singh, M. Sapienza, P. Torr, and F. Cuzzolin, Deep Learning for Detecting Multiple Space-Time Action Tubes in Videos, Procedings of the British Machine Vision Conference 2016, 2016.
DOI : 10.5244/C.30.58

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition. ICLR, 2014.

J. Van-gemert, M. Jain, E. Gati, and C. Snoek, APT: Action localization proposals from dense trajectories, Procedings of the British Machine Vision Conference 2015, 2015.
DOI : 10.5244/C.29.177

P. Weinzaepfel, Z. Harchaoui, and C. Schmid, Learning to Track for Spatio-Temporal Action Localization, 2015 IEEE International Conference on Computer Vision (ICCV), 2015.
DOI : 10.1109/ICCV.2015.362

URL : https://hal.archives-ouvertes.fr/hal-01159941

P. Weinzaepfel, X. Martin, and C. Schmid, Human action localization with sparse spatial supervision, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01317558

G. Yu and J. Yuan, Fast action proposals for human action detection and search, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
DOI : 10.1109/CVPR.2015.7298735

J. Yuan, Z. Liu, and Y. Wu, Discriminative subvolume search for efficient action detection, CVPR, 2009.