M. Bregonzio, S. Gong, and T. Xiang, Recognising action as clouds of space-time interest points, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206779

T. Brox and J. Malik, Object Segmentation by Long Term Analysis of Point Trajectories, ECCV, 2010.
DOI : 10.1007/978-3-642-15555-0_21

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005.
DOI : 10.1109/CVPR.2005.177

URL : https://hal.archives-ouvertes.fr/inria-00548512

N. Dalal, B. Triggs, and C. Schmid, Human Detection Using Oriented Histograms of Flow and Appearance, ECCV, 2006.
DOI : 10.1023/A:1008162616689

URL : https://hal.archives-ouvertes.fr/inria-00548587

P. Dollár, V. Rabaud, G. Cottrell, and S. Belongie, Behavior Recognition via Sparse Spatio-Temporal Features, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, 2005.
DOI : 10.1109/VSPETS.2005.1570899

G. Farnebäck, Two-Frame Motion Estimation Based on Polynomial Expansion, Scandinavian Conference on Image Analysis, 2003.
DOI : 10.1007/3-540-45103-X_50

L. Fei-fei and P. Perona, A Bayesian Hierarchical Model for Learning Natural Scene Categories, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005.
DOI : 10.1109/CVPR.2005.16

A. Gilbert, J. Illingworth, and R. Bowden, Action Recognition Using Mined Hierarchical Compound Features, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.33, issue.5, 2011.
DOI : 10.1109/TPAMI.2010.144

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.301.1835

N. Ikizler-cinbis and S. Sclaroff, Object, Scene and Actions: Combining Multiple Features for Human Action Recognition, ECCV, 2010.
DOI : 10.1007/978-3-642-15549-9_36

A. Kläser, M. Marsza?ek, I. Laptev, and C. Schmid, Will person detection help bag-of-features action recognition?, 2010.

A. Kläser, M. Marsza?ek, and C. Schmid, A Spatio-Temporal Descriptor Based on 3D-Gradients, Procedings of the British Machine Vision Conference 2008, 2008.
DOI : 10.5244/C.22.99

A. Kovashka and K. Grauman, Learning a hierarchy of discriminative space-time neighborhood features for human action recognition, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539881

I. Laptev and T. Lindeberg, Space-time interest points, ICCV, 2003.
DOI : 10.1109/iccv.2003.1238378

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.4.4359

I. Laptev, M. Marsza?ek, C. Schmid, and B. Rozenfeld, Learning realistic human actions from movies, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587756

URL : https://hal.archives-ouvertes.fr/inria-00548659

C. Liu, J. Yuen, and A. Torralba, Nonparametric scene parsing: label transfer via dense scene alignment, CVPR, 2009.

J. Liu, J. Luo, and M. Shah, Recognizing realistic actions from videos in the wild Learning dense optical-flow trajectory patterns for video object extraction, CVPR IEEE Conference on Advanced Video and Signal Based Surveillance, 2009.

B. D. Lucas and T. Kanade, An iterative image registration technique with an application to stereo vision, International Joint Conference on Artificial Intelligence, 1981.

M. Marsza?ek, I. Laptev, and C. Schmid, Actions in context, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206557

P. Matikainen, M. Hebert, and R. Sukthankar, Trajectons: Action recognition through the motion analysis of tracked features, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, 2009.
DOI : 10.1109/ICCVW.2009.5457659

R. Messing, C. Pal, and H. Kautz, Activity recognition using the velocity histories of tracked keypoints, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459154

E. Nowak, F. Jurie, and B. Triggs, Sampling Strategies for Bag-of-Features Image Classification, ECCV, 2006.
DOI : 10.1007/11744085_38

URL : https://hal.archives-ouvertes.fr/hal-00203752

M. Rodriguez, J. Ahmed, and M. Shah, Action MACH a spatio-temporal Maximum Average Correlation Height filter for action recognition, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587727

C. Schüldt, I. Laptev, and B. Caputo, Recognizing human actions: a local SVM approach, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004., 2004.
DOI : 10.1109/ICPR.2004.1334462

P. Scovanner, S. Ali, and M. Shah, A 3-dimensional sift descriptor and its application to action recognition, Proceedings of the 15th international conference on Multimedia , MULTIMEDIA '07, 2007.
DOI : 10.1145/1291233.1291311

J. Shi and C. Tomasi, Good features to track, CVPR, 1994.

J. Sun, X. Wu, S. Yan, L. Cheong, T. Chua et al., Hierarchical spatio-temporal context modeling for action recognition, CVPR, 2009.

N. Sundaram, T. Brox, and K. Keutzer, Dense Point Trajectories by GPU-Accelerated Large Displacement Optical Flow, ECCV, 2010.
DOI : 10.1007/978-3-642-15549-9_32

G. W. Taylor, R. Fergus, Y. Lecun, and C. Bregler, Convolutional Learning of Spatio-temporal Features, ECCV, 2010.
DOI : 10.1007/978-3-642-15567-3_11

H. Uemura, S. Ishikawa, and K. Mikolajczyk, Feature Tracking and Motion Compensation for Action Recognition, Procedings of the British Machine Vision Conference 2008, 2008.
DOI : 10.5244/C.22.30

M. M. Ullah, S. N. Parizi, and I. Laptev, Improving bag-offeatures action recognition with non-local cues, BMVC, 2010.

H. Wang, M. M. Ullah, A. Kläser, I. Laptev, and C. Schmid, Evaluation of local spatio-temporal features for action recognition, Procedings of the British Machine Vision Conference 2009, 2009.
DOI : 10.5244/C.23.124

URL : https://hal.archives-ouvertes.fr/inria-00439769

G. Willems, T. Tuytelaars, and L. V. , An Efficient Dense and Scale-Invariant Spatio-Temporal Interest Point Detector, ECCV, 2008.
DOI : 10.1007/978-3-540-88688-4_48

L. Yeffet and L. Wolf, Local Trinary Patterns for human action recognition, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459201

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.149.9905

J. Yuan, Z. Liu, and Y. Wu, Discriminative subvolume search for efficient action detection, CVPR, 2009.

J. Zhang, M. Marsza?ek, S. Lazebnik, and C. Schmid, Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study, International Journal of Computer Vision, vol.36, issue.1, pp.213-238, 2007.
DOI : 10.1007/s11263-006-9794-4

URL : https://hal.archives-ouvertes.fr/inria-00548574