On bodies and events, The Imitative Mind, 2002. ,
DOI : 10.1017/CBO9780511489969.013
Learning realistic human actions from movies, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008. ,
DOI : 10.1109/CVPR.2008.4587756
URL : https://hal.archives-ouvertes.fr/inria-00548659
Unsupervised learning of human action categories using spatial-temporal words, pp.299-318, 2008. ,
Recognizing human actions: a local SVM approach, ICPR, 2004. ,
Action Recognition with Improved Trajectories, 2013 IEEE International Conference on Computer Vision, 2013. ,
DOI : 10.1109/ICCV.2013.441
URL : https://hal.archives-ouvertes.fr/hal-00873267
Two-stream convolutional networks for action recognition in videos, NIPS, 2014. ,
ImageNet classification with deep convolutional neural networks, NIPS, 2012. ,
DOI : 10.1162/neco.2009.10-08-881
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.299.205
ImageNet: A large-scale hierarchical image database, CVPR, 2009. ,
Learning deep features for scene recognition using places database, NIPS, 2014. ,
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014. ,
DOI : 10.1109/CVPR.2014.81
URL : http://arxiv.org/abs/1311.2524
DeepFace: Closing the Gap to Human-Level Performance in Face Verification, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014. ,
DOI : 10.1109/CVPR.2014.220
Large-Scale Video Classification with Convolutional Neural Networks, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014. ,
DOI : 10.1109/CVPR.2014.223
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.471.3312
Learning Spatiotemporal Features with 3D Convolutional Networks, 2015 IEEE International Conference on Computer Vision (ICCV), 2015. ,
DOI : 10.1109/ICCV.2015.510
URL : http://arxiv.org/abs/1412.0767
Long-term recurrent convolutional networks for visual recognition and description, CVPR, 2015. ,
DOI : 10.1109/tpami.2016.2599174
URL : http://arxiv.org/abs/1411.4389
3D Convolutional Neural Networks for Human Action Recognition, ICML, 2010. ,
DOI : 10.1109/TPAMI.2012.59
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.169.4046
Convolutional Learning of Spatio-temporal Features, ECCV, 2010. ,
DOI : 10.1007/978-3-642-15567-3_11
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.178.9267
Visual categorization with bags of keypoints, ECCVW, 2004. ,
Improving the Fisher Kernel for Large-Scale Image Classification, ECCV, 2010. ,
DOI : 10.1007/978-3-642-15561-1_11
URL : https://hal.archives-ouvertes.fr/inria-00548630
Modeling video evolution for action recognition, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015. ,
DOI : 10.1109/CVPR.2015.7299176
Backpropagation Applied to Handwritten Zip Code Recognition, Neural Computation, vol.1, issue.4, pp.541-551, 1989. ,
DOI : 10.1007/BF00133697
Action recognition with trajectorypooled deep-convolutional descriptors, CVPR, 2015. ,
DOI : 10.1109/cvpr.2015.7299059
URL : http://arxiv.org/abs/1505.04868
Towards good practices for very deep two-stream convnets, 2015. ,
DOI : 10.1007/978-3-319-46484-8_2
URL : http://arxiv.org/abs/1608.00859
Dynamic Image Networks for Action Recognition, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. ,
DOI : 10.1109/CVPR.2016.331
Convolutional twostream network fusion for video action recognition, CVPR, 2016. ,
DOI : 10.1109/cvpr.2016.213
URL : http://arxiv.org/abs/1604.06573
Real-Time Action Recognition with Enhanced Motion Vector CNNs, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. ,
DOI : 10.1109/CVPR.2016.297
URL : http://arxiv.org/abs/1604.07669
Efficient Feature Extraction, Encoding, and Classification for Action Recognition, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014. ,
DOI : 10.1109/CVPR.2014.332
URL : https://hal.archives-ouvertes.fr/hal-01058734
Two-Frame Motion Estimation Based on Polynomial Expansion, SCIA, 2003. ,
DOI : 10.1007/3-540-45103-X_50
High Accuracy Optical Flow Estimation Based on a Theory for Warping, ECCV, 2004. ,
DOI : 10.1007/978-3-540-24673-2_3
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.4.1732
UCF101: A dataset of 101 human actions classes from videos in the wild, 2012. ,
HMDB: A large video database for human motion recognition, 2011 International Conference on Computer Vision, 2011. ,
DOI : 10.1109/ICCV.2011.6126543
URL : http://cbcl.mit.edu/publications/ps/Kuehne_etal_iccv11.pdf
Beyond Gaussian pyramid: Multi-skip feature stacking for action recognition, CVPR, 2015. ,
Beyond short snippets: Deep networks for video classification, CVPR, 2015. ,
Visualizing and Understanding Convolutional Networks, ECCV, 2014. ,
DOI : 10.1007/978-3-319-10590-1_53
URL : http://arxiv.org/abs/1311.2901