J. Davis, Hierarchical motion history images for recognizing human motion, Proceedings IEEE Workshop on Detection and Recognition of Events in Video, 2001.
DOI : 10.1109/EVENT.2001.938864

M. Ahad, J. Tan, H. Kim, and S. Ishikawa, Motion history image: its variants and applications, Machine Vision and Applications, 2010.
DOI : 10.1007/s00138-010-0298-4

J. K. Aggarwal and Q. Cai, Human motion analysis: a review, CVIU, 1999.

T. S. Kim and Z. Uddin, Silhouette-based Human Activity Recognition Using Independent Component Analysis, Linear Discriminant Analysis and Hidden Markov Model, InTech, 2010.
DOI : 10.5772/7614

Z. Lin, Z. Jiang, and L. S. Davis, Recognizing actions by shape-motion prototype trees, 2009.

R. Messing, C. Pal, and H. Kautz, Activity recognition using the velocity histories of tracked keypoints, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459154

M. Raptis and S. Soatto, Tracklet Descriptors for Action Modeling and Video Analysis, In: ECCV, 2010.
DOI : 10.1007/978-3-642-15549-9_42

M. B. Kaaniche and F. Bremond, Gesture recognition by learning local motion signatures, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539999

URL : https://hal.archives-ouvertes.fr/inria-00486110

H. Wang, A. Klaser, C. Schmid, and L. Cheng-lin, Action recognition by dense trajectories, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995407

URL : https://hal.archives-ouvertes.fr/inria-00583818

I. Laptev, On space-time interest points, IJCV, 2005.
DOI : 10.1007/s11263-005-1838-7

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.58.1419

K. Rapantzikos, Y. Avrithis, and S. Kollias, Dense saliency-based spatiotemporal feature points for action recognition, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206525

A. Klaser, M. Marszalek, and C. Schmid, A Spatio-Temporal Descriptor Based on 3D-Gradients, Procedings of the British Machine Vision Conference 2008, 2008.
DOI : 10.5244/C.22.99

URL : https://hal.archives-ouvertes.fr/inria-00514853

A. Gilbert, J. Illingworth, and R. Bowden, Fast realistic multi-action recognition using mined dense spatio-temporal features, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459335

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.158.3113

J. Liu and M. Shah, Learning human actions via information maximization, In: CVPR, 2008.

P. Dollar, V. Rabaud, G. Cottrell, and S. Belongie, Behavior Recognition via Sparse Spatio-Temporal Features, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, 2005.
DOI : 10.1109/VSPETS.2005.1570899

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.77.5712

G. Willems, T. Tuytelaars, and L. Gool, An efficient dense and scale-invariant spatiotemporal interest point detector, In: ECCV, 2008.
DOI : 10.1007/978-3-540-88688-4_48

H. Wang, M. M. Ullah, A. Klaser, I. Laptev, and C. Schmid, Evaluation of local spatio-temporal features for action recognition, Procedings of the British Machine Vision Conference 2009, 2009.
DOI : 10.5244/C.23.124

URL : https://hal.archives-ouvertes.fr/inria-00439769

I. Laptev, M. Marszalek, C. Schmid, and B. Rozenfeld, Learning realistic human actions from movies, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587756

URL : https://hal.archives-ouvertes.fr/inria-00548659

A. Gupta and L. S. Davis, Objects in Action: An Approach for Combining Action Understanding and Object Perception, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383331

L. J. Li and L. Fei-fei, What, where and who? Classifying events by scene and object recognition, 2007 IEEE 11th International Conference on Computer Vision, 2007.
DOI : 10.1109/ICCV.2007.4408872

M. Marszalek, I. Laptev, and C. Schmid, Actions in context, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206557

URL : https://hal.archives-ouvertes.fr/inria-00548645

J. Sun, X. Wu, S. Yan, L. F. Cheong, T. S. Chua et al., Hierarchical spatiotemporal context modeling for action recognition, In: CVPR, 2009.

J. Wang, Z. Chen, and Y. Wu, Action recognition with multiscale spatio-temporal contexts, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995493

P. Banerjee and R. Nevatia, Learning neighborhood co-occurrence statistics of sparse features for human activity recognition, In: AVSS, 2011.

A. Kovashka and K. Grauman, Learning a hierarchy of discriminative space-time neighborhood features for human action recognition, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539881

A. Oikonomopoulos, I. Patras, and M. Pantic, An implicit spatiotemporal shape model for human activity localisation and recognition, Human Communicative Behaviour Analysis, 2009.

A. P. Ta, C. Wolf, G. Lavoue, A. Baskurt, and J. M. Jolion, Pairwise Features for Human Action Recognition, 2010 20th International Conference on Pattern Recognition, 2010.
DOI : 10.1109/ICPR.2010.788

URL : https://hal.archives-ouvertes.fr/hal-01381471

C. Schuldt, I. Laptev, and B. Caputo, Recognizing human actions: a local SVM approach, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004., 2004.
DOI : 10.1109/ICPR.2004.1334462

J. Liu, J. Luo, and M. Shah, Recognizing realistic actions from videos " in the wild, 2009.

X. Wu, D. Xu, L. Duan, and J. Luo, Action recognition using context and appearance distribution features, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995624

T. K. Kim, S. F. Wong, and R. Cipolla, Tensor Canonical Correlation Analysis for Action Classification, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383137

S. Wu, O. Oreifej, and M. Shah, Action recognition in videos acquired by a moving camera using motion decomposition of Lagrangian particle trajectories, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126397

Z. Jiang, Z. Lin, and L. Davis, Recognizing human actions by learning and matching shape-motion prototype trees, PAMI, 2011.