High accuracy optical flow estimation based on a theory for warping, ECCV, 2004. ,
Quo vadis, action recognition? a new model and the kinetics dataset, CVPR, 2008. ,
Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs, IEEE Trans. PAMI, issue.2, 2018. ,
Efficient two-stream motion and appearance 3D CNNs for video classification, ECCV workshop, 2016. ,
Long-term recurrent convolutional networks for visual recognition and description, CVPR, 2015. ,
Flownet: Learning optical flow with convolutional networks, ICCV, 2015. ,
End-to-end learning of motion representation for video understanding, CVPR, 2018. ,
Convolutional two-stream network fusion for video action recognition, CVPR, 2016. ,
Modality distillation with multiple stream networks for action recognition, ECCV, 2018. ,
The something something video database for learning and evaluating visual common sense, ICCV, 2005. ,
Can spatiotemporal 3D CNNs retrace the history of 2D CNNs and ImageNet, CVPR, vol.3, 2018. ,
Piotr Dollár, and Ross Girshick. Mask R-CNN, ICCV, 2017. ,
Deep residual learning for image recognition, CVPR, vol.1, 2016. ,
Distilling the knowledge in a neural network, NIPS workshop, vol.2, p.3, 2014. ,
Learning with side information through modality hallucination, CVPR, 2016. ,
Flownet 2.0: Evolution of optical flow estimation with deep networks, CVPR, 2017. ,
Efficient feature extraction, encoding and classification for action recognition, CVPR, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01058734
, The kinetics human action video dataset, vol.1, p.4, 2017.
ImageNet classification with deep convolutional neural networks, NIPS, vol.1, 2012. ,
HMDB: A large video database for human motion recognition, ICCV, vol.3, p.4, 2011. ,
Motion feature network: Fixed motion filter for action recognition, ECCV, vol.3, 2018. ,
RESOUND: Towards action recognition without representation bias, ECCV, 2018. ,
Fully convolutional networks for semantic segmentation, CVPR, 2015. ,
Unifying distillation and privileged information, ICLR, 2016. ,
Graph distillation for action detection with privileged modalities, ECCV, 2018. ,
ActionFlowNet: Learning motion representation for action recognition, WACV, 2018. ,
Optical flow estimation using a spatial pyramid network, CVPR, 2017. ,
Faster R-CNN: Towards real-time object detection with region proposal networks, NIPS, 2015. ,
Epicflow: Edge-preserving interpolation of correspondences for optical flow, CVPR, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01097477
On the integration of optical flow and action recognition, GCPR, 2018. ,
CNN features off-the-shelf: An astounding baseline for recognition, CVPR workshops, 2014. ,
Two-stream convolutional networks for action recognition in videos, NIPS, 2008. ,
UCF101: A dataset of 101 human actions classes from videos in the wild, vol.3, p.4, 2012. ,
PWC-Net: CNNs for optical flow using pyramid, warping, and cost volume, CVPR, vol.1, p.5, 2018. ,
Optical flow guided feature: a fast and robust motion representation for video action recognition, CVPR, vol.3, 2018. ,
Going deeper with convolutions, CVPR, 2015. ,
Learning spatiotemporal features with 3D convolutional networks, ICCV, 2008. ,
A closer look at spatiotemporal convolutions for action recognition, CVPR, 2008. ,
Learning using privileged information: Similarity control and knowledge transfer, JMLR, vol.2, p.4, 2015. ,
Long-term temporal convolutions for action recognition, IEEE Trans. PAMI, issue.6, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01241518
Temporal segment networks: Towards good practices for deep action recognition, ECCV, 2016. ,
Abhinav Gupta, and Kaiming He. Non-local neural networks, CVPR, vol.7, 2018. ,
Aggregated residual transformations for deep neural networks, CVPR, 2017. ,
Rethinking spatiotemporal feature learning: Speed-accuracy trade-offs in video classification, ECCV, 2008. ,
A duality based approach for realtime TV-L1 optical flow, Joint Pattern Recognition Symposium, vol.5, p.6, 2003. ,
Visualizing and understanding convolutional networks, ECCV, 2014. ,
Temporal relational reasoning in videos, ECCV, 2018. ,
Learning deep features for discriminative localization, CVPR, 2016. ,
Hidden two-stream convolutional networks for action recognition, ACCV, 2018. ,