Quo vadis, action recognition? A new model and the kinetics dataset, CVPR, 2017. ,
Potion: Pose motion representation for action recognition, CVPR, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01764222
Visualizing higher-layer features of a deep network, 2009. ,
, Jitendra Malik, and Kaiming He. SlowFast Networks for Video Recognition, 2018.
Attentional pooling for action recognition, NIPS, 2017. ,
, Video Action Transformer Network, 2018.
Something Something" video database for learning and evaluating visual common sense, ICCV, 2017. ,
Deep residual learning for image recognition, CVPR, 2016. ,
ActivityNet: A large-scale video benchmark for human activity understanding, CVPR, 2015. ,
Spatial transformer networks, NIPS, 2015. ,
Processing Megapixel Images with Deep Attention-Sampling Models, ICML, 2019. ,
VideoLSTM convolves, attends and flows for action recognition. Computer Vision and Image Understanding, 2018. ,
Temporal shift module for efficient video understanding, 2018. ,
Attention clusters: Purely attention based local feature integration for video classification, CVPR, 2018. ,
Moments in time dataset: one million videos for event understanding, 2019. ,
Grad-CAM: Visual Explanations From Deep Networks via Gradient-Based Localization, ICCV, 2017. ,
Action recognition using visual attention, ICLR (workshop track), 2016. ,
, , 2018.
Hollywood in homes: Crowdsourcing data collection for activity understanding, ECCV, 2016. ,
Asynchronous temporal fields for action recognition, CVPR, 2017. ,
Charades-ego: A large-scale dataset of paired third and first person videos, 2018. ,
Two-stream convolutional networks for action recognition in videos, NIPS, 2014. ,
Very Deep Convolutional Networks for Large-Scale Image Recognition, ICLR, 2015. ,
, Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps, 2013.
Striving for simplicity: The all convolutional net, ICLR (workshop track), 2015. ,
Attention is All We Need: Nailing Down Object-centric Attention for Egocentric Activity Recognition, BMVC, 2018. ,
Actor-centric relation network, ECCV, 2018. ,
Learning spatiotemporal features with 3d convolutional networks, ICCV, 2015. ,
Long-term temporal convolutions for action recognition, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01241518
Attention Is All You Need, NIPS, 2017. ,
Temporal segment networks: Towards good practices for deep action recognition, ECCV, 2016. ,
Temporal segment networks for action recognition in videos, 2018. ,
Videos as space-time region graphs, ECCV, 2018. ,
Abhinav Gupta, and Kaiming He. Non-local neural networks, CVPR, 2018. ,
Eidetic 3D LSTM: A Model for Video Prediction and Beyond, ICLR, 2019. ,
Long-Term Feature Banks for Detailed Video Understanding, CVPR, 2019. ,
End-to-end learning of action detection from frame glimpses in videos, CVPR, 2016. ,
Beyond short snippets: Deep networks for video classification, CVPR, 2015. ,
Visualizing and understanding convolutional networks, ECCV, 2014. ,
Two-Stream Oriented Video SuperResolution for Action Recognition, 2019. ,
Learning Deep Features for Discriminative Localization, CVPR, 2016. ,
Temporal relational reasoning in videos, ECCV, 2018. ,
Chained multi-stream networks exploiting pose, motion, and appearance for action classification and detection, ICCV, 2017. ,