129 5.4.1 Datasets and metrics, p.130 ,
Actions as space-time shapes, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.1395-1402, 2005. ,
DOI : 10.1109/ICCV.2005.28
Hierarchical matching pursuit for image classification: Architecture and fast algorithms, Advances in Neural Information Processing Systems, pp.6-39, 2011. ,
Finding Actors and Actions in Movies, 2013 IEEE International Conference on Computer Vision, p.90, 2013. ,
DOI : 10.1109/ICCV.2013.283
URL : https://hal.archives-ouvertes.fr/hal-00904991
Weakly Supervised Action Labeling in Videos under Ordering Constraints, Proceedings of the European Conference on Computer Vision, pp.628-643, 2014. ,
DOI : 10.1007/978-3-319-10602-1_41
URL : https://hal.archives-ouvertes.fr/hal-01053967
Integrating structured biological data by Kernel Maximum Mean Discrepancy, Bioinformatics, p.76, 2006. ,
DOI : 10.1093/bioinformatics/btl242
URL : https://academic.oup.com/bioinformatics/article-pdf/22/14/e49/616383/btl242.pdf
Unsupervised pixel-level domain adaptation with generative adversarial networks. arXiv preprint, p.42, 2016. ,
DOI : 10.1109/cvpr.2017.18
Domain separation networks, Advances in Neural Information Processing Systems, pp.343-351, 2016. ,
Shadow puppetry, Proceedings of the Seventh IEEE International Conference on Computer Vision, pp.1237-1244, 1999. ,
DOI : 10.1109/ICCV.1999.790422
High Accuracy Optical Flow Estimation Based on a Theory for Warping, Proceedings of the European Conference on Computer Vision, p.127, 2004. ,
DOI : 10.1007/978-3-540-24673-2_3
Object Segmentation by Long Term Analysis of Point Trajectories, Proceedings of the European Conference on Computer Vision, p.66, 2010. ,
DOI : 10.1007/978-3-642-15555-0_21
Actionness Ranking with Lattice Conditional Ordinal Random Fields, 2014 IEEE Conference on Computer Vision and Pattern Recognition, p.121, 2014. ,
DOI : 10.1109/CVPR.2014.101
URL : http://web.eecs.umich.edu/%7Ejjcorso/pubs/jcorso_CVPR2014_actionness.pdf
Semi-supervised Learning of Facial Attributes in Video, Proceedings of the European Conference on Computer Vision, pp.43-56, 2010. ,
DOI : 10.1007/978-3-642-35749-7_4
Dlid: Deep learning for domain adaptation by interpolating between domains, ICML 2013, Workshop on Representation Learning, p.38, 2013. ,
Multi-fold MIL Training for Weakly Supervised Object Localization, 2014 IEEE Conference on Computer Vision and Pattern Recognition, p.148, 2014. ,
DOI : 10.1109/CVPR.2014.309
URL : https://hal.archives-ouvertes.fr/hal-00975746
Weakly Supervised Object Localization with Multi-Fold Multiple Instance Learning, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.39, issue.1, p.89, 2016. ,
DOI : 10.1109/TPAMI.2016.2535231
URL : https://hal.archives-ouvertes.fr/hal-01123482
Visual categorization with bags of keypoints, Workshop on statistical learning in computer vision, ECCV, pp.1-2, 2004. ,
Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.25-26, 2005. ,
DOI : 10.1109/CVPR.2005.177
URL : https://hal.archives-ouvertes.fr/inria-00548512
Frustratingly easy domain adaptation. arXiv Preprint, p.36, 2009. ,
Behavior discovery and alignment of articulated object classes from unstructured video, International Journal of Computer Vision, pp.1-23, 2016. ,
Discovering the physical parts of an articulated object class from multiple videos, Proceedings of Bibliography, 2016. ,
Scalable Object Detection Using Deep Neural Networks, 2014 IEEE Conference on Computer Vision and Pattern Recognition, p.22, 2014. ,
DOI : 10.1109/CVPR.2014.276
URL : http://www.cv-foundation.org/openaccess/content_cvpr_2014/papers/Erhan_Scalable_Object_Detection_2014_CVPR_paper.pdf
DAPs: Deep Action Proposals for Action Understanding, Proceedings of the European Conference on Computer Vision, pp.768-784, 2016. ,
DOI : 10.1007/978-3-319-10602-1_26
On the relationship between visual attributes and convolutional networks, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.90, 2015. ,
DOI : 10.1109/CVPR.2015.7298730
The Pascal Visual Object Classes (VOC) Challenge, International Journal of Computer Vision, vol.73, issue.2, p.96, 2010. ,
DOI : 10.1371/journal.pcbi.0040027
The Pascal Visual Object Classes (VOC) Challenge, International Journal of Computer Vision, vol.73, issue.2, pp.61-62, 2007. ,
DOI : 10.1371/journal.pcbi.0040027
Learning to Recognize Activities from the Wrong View Point, Proceedings of the European Conference on Computer Vision, pp.154-166, 2008. ,
DOI : 10.1145/1273496.1273637
Object Detection with Discriminatively Trained Part-Based Models, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.9, pp.64-73, 2010. ,
DOI : 10.1109/TPAMI.2009.167
Pictorial Structures for Object Recognition, International Journal of Computer Vision, vol.61, issue.1, pp.55-79, 2005. ,
DOI : 10.1023/B:VISI.0000042934.15159.49
Unsupervised Visual Domain Adaptation Using Subspace Alignment, 2013 IEEE International Conference on Computer Vision, pp.2960-2967, 2013. ,
DOI : 10.1109/ICCV.2013.368
URL : https://hal.archives-ouvertes.fr/hal-00869417
Recognizing primitive interactions by exploring actor-object states, 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-7, 2008. ,
DOI : 10.1109/CVPR.2008.4587726
URL : http://cs.fit.edu/~eribeiro/papers/FilipovychRibeiro_cvpr2008b.pdf
Robust sequence alignment for actor???object interaction recognition: Discovering actor???object states, Computer Vision and Image Understanding, vol.115, issue.2, pp.177-193, 2011. ,
DOI : 10.1016/j.cviu.2010.11.012
Temporal Localization of Actions with Actoms, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, issue.11, pp.2782-2795, 2013. ,
DOI : 10.1109/TPAMI.2013.65
URL : https://hal.archives-ouvertes.fr/hal-00687312
Online Domain Adaptation for Multi-Object Tracking, Procedings of the British Machine Vision Conference 2015, p.61, 2015. ,
DOI : 10.5244/C.29.3
Self-learning camera: Autonomous adaptation of object detectors to unlabeled video streams. arXiv preprint, p.61, 2014. ,
Unsupervised domain adaptation by backpropagation, International Conference on Machine Learning, pp.1180-1189, 2015. ,
APT: Action localization proposals from dense trajectories, Procedings of the British Machine Vision Conference 2015, p.121, 2015. ,
DOI : 10.5244/C.29.177
Domain Adaptive Neural Networks for Object Recognition, Pacific Rim International Conference on Artificial Intelligence, pp.898-904, 2014. ,
DOI : 10.1007/978-3-319-13560-1_76
URL : http://arxiv.org/pdf/1409.6041.pdf
Deep Reconstruction-Classification Networks for Unsupervised Domain Adaptation, Proceedings of the European Conference on Computer Vision, pp.597-613, 2016. ,
DOI : 10.2307/1912526
Object Detection via a Multi-region and Semantic Segmentation-Aware CNN Model, 2015 IEEE International Conference on Computer Vision (ICCV), pp.1134-1142, 2015. ,
DOI : 10.1109/ICCV.2015.135
Fast R-CNN, 2015 IEEE International Conference on Computer Vision (ICCV), pp.27-28, 2015. ,
DOI : 10.1109/ICCV.2015.169
Fast R-CNN. https://github.com/rbgirshick/fast-rcnn, p.27, 2015. ,
DOI : 10.1109/iccv.2015.169
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp.61-63, 2014. ,
DOI : 10.1109/CVPR.2014.81
URL : http://arxiv.org/pdf/1311.2524
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, 2014 IEEE Conference on Computer Vision and Pattern Recognition, p.65, 2014. ,
DOI : 10.1109/CVPR.2014.81
URL : http://arxiv.org/pdf/1311.2524
Discriminatively trained deformable part models, release 5, pp.61-65, 2012. ,
Finding action tubes, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.121-127, 2015. ,
DOI : 10.1109/CVPR.2015.7298676
Geodesic flow kernel for unsupervised domain adaptation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.2066-2073, 2012. ,
Do semantic parts emerge in convolutional neural networks? arXiv Preprint, p.23, 2016. ,
DOI : 10.1007/s11263-017-1048-0
Objects as context for part detection, 2017. ,
An active search strategy for efficient object class detection, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.3022-3031, 2015. ,
DOI : 10.1109/CVPR.2015.7298921
URL : http://arxiv.org/abs/1412.3709
Generative adversarial nets, Advances in Neural Information Processing Systems, pp.2672-2680, 2014. ,
Domain adaptation for object recognition: An unsupervised approach, 2011 International Conference on Computer Vision, pp.999-1006, 2011. ,
DOI : 10.1109/ICCV.2011.6126344
URL : http://www.umiacs.umd.edu/~raghuram/Publications/2011_ICCV_DomainAdaptation.pdf
Unsupervised Adaptation Across Domain Shifts by Generating Intermediate Data Representations, IEEE Transactions on Pattern Analysis and Machine Intelligence, p.56, 2014. ,
DOI : 10.1109/TPAMI.2013.249
URL : http://www.research.att.com/export/sites/att_labs/techdocs/TD_101340.pdf
Efficient hierarchical graphbased video segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, p.100, 2010. ,
DOI : 10.1109/cvpr.2010.5539893
URL : http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/pubs/archive/36247.pdf
Recognition using regions, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.1030-1037, 2009. ,
Observing Human-Object Interactions: Using Spatial and Functional Compatibility for Recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.31, issue.10, p.90, 2009. ,
DOI : 10.1109/TPAMI.2009.83
Aligning 3d models to rgbd images of cluttered scenes, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.4731-4740, 2015. ,
Learning Rich Features from RGB-D Images for Object Detection and Segmentation, Proceedings of the European Conference on Computer Vision, pp.345-360, 2014. ,
DOI : 10.1007/978-3-319-10584-0_23
URL : http://arxiv.org/pdf/1407.5736
Cross Modal Distillation for Supervision Transfer, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.2827-2836, 2016. ,
DOI : 10.1109/CVPR.2016.309
Weakly Supervised Learning of Object Segmentations from Web-Scale Video, ECCV, Workshops and Demonstrations, pp.198-208, 2012. ,
DOI : 10.1007/978-3-642-33863-2_20
URL : http://www.cs.cmu.edu/~rahuls/pub/eccv2012wk-cp-rahuls.pdf
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.37, issue.9, pp.1904-1916, 2015. ,
DOI : 10.1109/TPAMI.2015.2389824
Deep Residual Learning for Image Recognition, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.770-778, 2016. ,
DOI : 10.1109/CVPR.2016.90
URL : http://arxiv.org/pdf/1512.03385
ActivityNet: A large-scale video benchmark for human activity understanding, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.149, 2015. ,
DOI : 10.1109/CVPR.2015.7298698
End-to-end training of object class detectors for mean average precision. arXiv Preprint, p.22, 2016. ,
Learning discriminative localization from weakly labeled data, Pattern Recognition, vol.47, issue.3, pp.1523-1534, 2014. ,
DOI : 10.1016/j.patcog.2013.09.028
Learning with Side Information through Modality Hallucination, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.826-834, 2016. ,
DOI : 10.1109/CVPR.2016.96
Efficient learning of domain-invariant image representations, International Conference on Learning Representations, p.37, 2013. ,
Asymmetric and Category Invariant Feature Transformations for Domain Adaptation, International Journal of Computer Vision, vol.39, issue.12, p.75, 2014. ,
DOI : 10.1109/TPAMI.2009.151
Oneshot learning of supervised deep convolutional models, arXiv Preprint, p.39, 2014. ,
Model-based vision: a program to see a walking person, Image and Vision Computing, vol.1, issue.1, pp.5-20, 1983. ,
DOI : 10.1016/0262-8856(83)90003-3
Tube convolutional neural network (t-cnn) for action detection in videos. arXiv Preprint, p.49, 2017. ,
DOI : 10.1109/iccv.2017.620
A Survey on Visual Surveillance of Object Motion and Behaviors, IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), vol.34, issue.3, p.14, 2004. ,
DOI : 10.1109/TSMCC.2004.829274
Speed/accuracy trade-offs for modern convolutional object detectors. arXiv Preprint, p.123, 2016. ,
DOI : 10.1109/cvpr.2017.351
Densebox: Unifying landmark localization with end to end object detection. arXiv preprint, 2015. ,
Do motion boundaries improve semantic segmentation?, p.147, 2016. ,
DOI : 10.1002/lnc3.357
Action Localization with Tubelets from Motion, 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp.740-747, 2014. ,
DOI : 10.1109/CVPR.2014.100
URL : https://hal.archives-ouvertes.fr/hal-00996844
Separating non-stationary from stationary scene components in a sequence of real world TV-images, 1977. ,
Towards Understanding Action Recognition, 2013 IEEE International Conference on Computer Vision, pp.52-129, 2013. ,
DOI : 10.1109/ICCV.2013.396
URL : https://hal.archives-ouvertes.fr/hal-00906902
Towards Understanding Action Recognition, 2013 IEEE International Conference on Computer Vision, pp.3192-3199, 2013. ,
DOI : 10.1109/ICCV.2013.396
URL : https://hal.archives-ouvertes.fr/hal-00906902
Caffe, Proceedings of the ACM International Conference on Multimedia, MM '14, p.64, 2013. ,
DOI : 10.1145/2647868.2654889
Caffe, Proceedings of the ACM International Conference on Multimedia, MM '14, p.76, 2014. ,
DOI : 10.1145/2647868.2654889
A literature survey on domain adaptation of statistical classifiers, 2008. ,
Analysing Domain Shift Factors between Videos and Images for Object Detection, IEEE Transactions on Pattern Analysis and Machine Intelligence. xi, xii, pp.51-56, 2016. ,
DOI : 10.1109/TPAMI.2016.2551239
URL : https://hal.archives-ouvertes.fr/hal-01281069
Action Tubelet Detector for Spatio-Temporal Action Localization, 2017 IEEE International Conference on Computer Vision (ICCV), p.118, 2017. ,
DOI : 10.1109/ICCV.2017.472
URL : https://hal.archives-ouvertes.fr/hal-01519812
Joint Learning of Object and Action Detectors, 2017 IEEE International Conference on Computer Vision (ICCV), p.86, 2017. ,
DOI : 10.1109/ICCV.2017.219
URL : https://hal.archives-ouvertes.fr/hal-01575804
Multi-View Discriminant Analysis, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.38, issue.1, pp.188-194, 2016. ,
DOI : 10.1109/TPAMI.2015.2435740
URL : http://figment.cse.usf.edu/~sfefilat/data/papers/WeBT4.3.pdf
Object Detection in Videos with Tubelet Proposal Networks, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.50, 2017. ,
DOI : 10.1109/CVPR.2017.101
T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, p.50, 2016. ,
DOI : 10.1109/TCSVT.2017.2736553
Object Detection from Video Tubelets with Convolutional Neural Networks, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.89, 2016. ,
DOI : 10.1109/CVPR.2016.95
URL : http://arxiv.org/pdf/1604.04053
ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised Localization, Proceedings of the European Conference on Computer Vision, pp.350-365, 2016. ,
DOI : 10.1007/s11263-009-0275-4
URL : https://hal.archives-ouvertes.fr/hal-01421772
Large-Scale Video Classification with Convolutional Neural Networks, 2014 IEEE Conference on Computer Vision and Pattern Recognition, p.86, 2014. ,
DOI : 10.1109/CVPR.2014.223
URL : http://www.cs.cmu.edu/~rahuls/pub/cvpr2014-deepvideo-rahuls.pdf
Joint summarization of large sets of web images and videos for storyline reconstruction, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, p.56, 2014. ,
A Spatio-Temporal Descriptor Based on 3D-Gradients, Procedings of the British Machine Vision Conference 2008, pp.275-276, 2008. ,
DOI : 10.5244/C.22.99
URL : https://hal.archives-ouvertes.fr/inria-00514853
Human Focused Action Localization in Video, SGA 2010-International Workshop on Sign, Gesture, and Activity, ECCV 2010 Workshops, pp.219-233, 2010. ,
DOI : 10.1007/978-3-642-35749-7_17
URL : https://hal.archives-ouvertes.fr/inria-00514845
ImageNet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems. xii, pp.65-89, 2012. ,
DOI : 10.1162/neco.2009.10-08-881
URL : http://dl.acm.org/ft_gateway.cfm?id=3065386&type=pdf
Fast Optical Flow Using Dense Inverse Search, Proceedings of the European Conference on Computer Vision, p.122, 2016. ,
DOI : 10.1109/CVPR.2015.7298704
URL : http://arxiv.org/pdf/1603.03590
HMDB: A large video database for human motion recognition, 2011 International Conference on Computer Vision, pp.2556-2563, 2011. ,
DOI : 10.1109/ICCV.2011.6126543
URL : http://cbcl.mit.edu/publications/ps/Kuehne_etal_iccv11.pdf
What you saw is not what you get: Domain adaptation using asymmetric kernel transforms, CVPR 2011, pp.1785-1792, 2011. ,
DOI : 10.1109/CVPR.2011.5995702
URL : http://people.ee.duke.edu/~lcarin/cvpr_adapt.pdf
Track and Transfer: Watching Videos to Simulate Strong Human Supervision for Weakly-Supervised Object Detection, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.3548-3556, 2016. ,
DOI : 10.1109/CVPR.2016.386
URL : http://arxiv.org/pdf/1604.05766
DeepBox: Learning Objectness with Convolutional Networks, 2015 IEEE International Conference on Computer Vision (ICCV), pp.2479-2487, 2015. ,
DOI : 10.1109/ICCV.2015.285
URL : http://arxiv.org/pdf/1505.02146
Discriminative virtual views for cross-view action recognition, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.2855-2862, 2012. ,
R-fcn: Object detection via region-based fully convolutional networks, Advances in Neural Information Processing Systems, pp.379-387, 2016. ,
Revisiting batch normalization for practical domain adaptation. arXiv Preprint, p.38, 2016. ,
VideoLSTM convolves, attends and flows for action recognition, arXiv Preprint, p.121, 2016. ,
DOI : 10.1016/j.cviu.2017.10.011
Towards Computational Baby Learning: A Weakly-Supervised Approach for Object Detection, 2015 IEEE International Conference on Computer Vision (ICCV), pp.999-1007, 2015. ,
DOI : 10.1109/ICCV.2015.120
Network in network, International Conference on Learning Representations, p.19, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00737767
Microsoft COCO: Common Objects in Context, Proceedings of the European Conference on Computer Vision, p.141, 2014. ,
DOI : 10.1007/978-3-319-10602-1_48
URL : http://arxiv.org/pdf/1405.0312.pdf
Recognizing human actions by attributes, CVPR 2011, p.99, 2011. ,
DOI : 10.1109/CVPR.2011.5995353
URL : http://web.eecs.umich.edu/~kuipers/papers/Liu-cvpr-11_action_attributes.pdf
Coupled generative adversarial networks, Advances in Neural Information Processing Systems, pp.469-477, 2016. ,
SSD: Single Shot MultiBox Detector, Proceedings of the European Conference on Computer Vision. xiii, pp.31-86, 2016. ,
DOI : 10.1109/CVPR.2008.4587597
URL : http://arxiv.org/pdf/1512.02325
Learning transferable features with deep adaptation networks, International Conference on Machine Learning, pp.97-105, 2015. ,
DOI : 10.1109/tkde.2016.2554549
Deep transfer learning with joint adaptation networks. arXiv Preprint, p.40, 2016. ,
DOI : 10.1109/iccv.2013.274
URL : http://learn.tsinghua.edu.cn:8080/2011310560/publications/joint-iccv14.pdf
Unsupervised domain adaptation with residual transfer networks, Advances in Neural Information Processing Systems, pp.136-144, 2016. ,
Object recognition from local scale-invariant features, Proceedings of the Seventh IEEE International Conference on Computer Vision, pp.1150-1157, 1999. ,
DOI : 10.1109/ICCV.1999.790410
URL : http://www-inst.cs.berkeley.edu/~cs294-6/fa06/papers/LoweD_Object recognition from local scale-invariant features.pdf
Visual Relationship Detection with Language Priors, Proceedings of the European Conference on Computer Vision, pp.90-104, 2016. ,
DOI : 10.1023/B:VISI.0000029664.99615.94
Action Recognition and Localization by Hierarchical Space-Time Segments, 2013 IEEE International Conference on Computer Vision, p.89, 2013. ,
DOI : 10.1109/ICCV.2013.341
URL : http://cs-people.bu.edu/shugaoma/STSegments/iccv2013_preprint_shugao.pdf
Ensemble of exemplar-SVMs for object detection and beyond, 2011 International Conference on Computer Vision, p.89, 2011. ,
DOI : 10.1109/ICCV.2011.6126229
Prime Object Proposals with Randomized Prim's Algorithm, 2013 IEEE International Conference on Computer Vision, pp.2536-2543, 2013. ,
DOI : 10.1109/ICCV.2013.315
Pathtrack: Fast trajectory annotation with path supervision. arXiv preprint, 2017. ,
DOI : 10.1109/iccv.2017.40
Deep captioning with multimodal recurrent neural networks (m-rnn), International Conference on Learning Representations, p.90, 2015. ,
Unsupervised Tube Extraction Using Transductive Learning and Dense Trajectories, 2015 IEEE International Conference on Computer Vision (ICCV), pp.1653-1661, 2015. ,
DOI : 10.1109/ICCV.2015.193
Dynamic scene analysis: The study of moving images, 1977. ,
DOI : 10.21236/ADA042124
Deep Exemplar 2D-3D Detection by Adapting from Real to Rendered Views, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.6024-6033, 2016. ,
DOI : 10.1109/CVPR.2016.648
URL : http://arxiv.org/pdf/1512.02497
Dynamic Eye Movement Datasets and Learnt Saliency Models for Visual Action Recognition, Proceedings of the European Conference on Computer Vision, pp.842-856, 2012. ,
DOI : 10.1007/978-3-642-33709-3_60
URL : http://sminchisescu.ins.uni-bonn.de/papers/ms12eccv.pdf
Representing Pairwise Spatial and Temporal Relations for Action Recognition, Computer Vision?ECCV, issue.8, pp.508-521, 2010. ,
DOI : 10.1007/978-3-642-15549-9_37
URL : http://www.ri.cmu.edu/pub_files/2010/9/eccv2010pyry.pdf
Activity recognition using the velocity histories of tracked keypoints, 2009 IEEE 12th International Conference on Computer Vision, pp.104-111, 2009. ,
DOI : 10.1109/ICCV.2009.5459154
Spot On: Action Localization from Pointly-Supervised Proposals, Proceedings of the European Conference on Computer Vision, pp.437-453, 2016. ,
DOI : 10.1007/s11263-013-0636-x
Watch and learn: Semi-supervised learning of object detectors from videos, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.3593-3602, 2015. ,
DOI : 10.1109/CVPR.2015.7298982
Unsupervised domain adaptation in the wild: Dealing with asymmetric label sets. arXiv preprint, 2016. ,
Learning semantic part-based models from google images. arXiv Preprint, p.147, 2016. ,
DOI : 10.1109/tpami.2017.2724029
Multimodal deep learning, International Conference on Machine Learning, pp.689-696, 2011. ,
DASH-N: Joint Hierarchical Domain Adaptation and Feature Learning, IEEE Transactions on Image Processing, vol.24, issue.12, pp.245479-5491, 2015. ,
DOI : 10.1109/TIP.2015.2479405
Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification, Proceedings of the European Conference on Computer Vision, pp.392-405, 2010. ,
DOI : 10.1007/978-3-642-15552-9_29
Learning Deconvolution Network for Semantic Segmentation, 2015 IEEE International Conference on Computer Vision (ICCV), p.21, 2015. ,
DOI : 10.1109/ICCV.2015.178
A large-scale benchmark dataset for event recognition in surveillance video, CVPR 2011, pp.3153-3160, 2011. ,
DOI : 10.1109/CVPR.2011.5995586
A large-scale benchmark dataset for event recognition in surveillance video, CVPR 2011, p.14, 2011. ,
DOI : 10.1109/CVPR.2011.5995586
Efficient deep models for monocular road segmentation, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp.4885-4891, 2016. ,
DOI : 10.1109/IROS.2016.7759717
Seeing the Objects Behind the Dots: Recognition in Videos from??a??Moving Camera, International Journal of Computer Vision, vol.3, issue.5, pp.57-71, 2009. ,
DOI : 10.1007/s11263-009-0211-7
Spatio-temporal Object Detection Proposals, Proceedings of the European Conference on Computer Vision, p.121, 2014. ,
DOI : 10.1007/978-3-319-10578-9_48
URL : https://hal.archives-ouvertes.fr/hal-01021902
Efficient Action Localization with Approximately Normalized Fisher Vectors, 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp.2545-2552, 2014. ,
DOI : 10.1109/CVPR.2014.326
URL : https://hal.archives-ouvertes.fr/hal-00979594
DeepID-Net: Deformable deep convolutional neural networks for object detection, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.2403-2412, 2015. ,
DOI : 10.1109/CVPR.2015.7298854
URL : http://www.ee.cuhk.edu.hk/%7Exgwang/papers/deepIDNetCVPR15.pdf
A Survey on Transfer Learning, IEEE Transactions on Knowledge and Data Engineering, vol.22, issue.10, p.56, 2010. ,
DOI : 10.1109/TKDE.2009.191
Scene recognition and weakly supervised object localization with deformable part-based models, 2011 International Conference on Computer Vision, p.89, 2011. ,
DOI : 10.1109/ICCV.2011.6126383
URL : http://www.cs.unc.edu/~lazebnik/publications/megha_iccv2011.pdf
Training Object Class Detectors from Eye Tracking Data, Proceedings of the European Conference on Computer Vision, pp.361-376, 2014. ,
DOI : 10.1007/978-3-319-10602-1_24
URL : http://groups.inf.ed.ac.uk/calvin/Publications/papadopouloseccv14.pdf
We Don???t Need No Bounding-Boxes: Training Object Class Detectors Using Only Human Verification, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.854-863, 2016. ,
DOI : 10.1109/CVPR.2016.99
URL : http://arxiv.org/pdf/1602.08405
Training Object Class Detectors with Click Supervision, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.148, 2017. ,
DOI : 10.1109/CVPR.2017.27
Discovering object aspects from video, Image and Vision Computing, vol.52, pp.206-217, 2016. ,
DOI : 10.1016/j.imavis.2016.04.014
Video Temporal Alignment for Object Viewpoint, Proceedings of the Asian Conference on Computer Vision, pp.273-288, 2016. ,
DOI : 10.1109/CVPR.2012.6248065
Fast Object Segmentation in Unconstrained Video, 2013 IEEE International Conference on Computer Vision, p.67, 2013. ,
DOI : 10.1109/ICCV.2013.223
Multi-region Two-Stream R-CNN for Action Detection, Proceedings of the European Conference on Computer Vision. xvi, xx, xxi, pp.138-139, 2016. ,
DOI : 10.1109/CVPR.2015.7298735
URL : https://hal.archives-ouvertes.fr/hal-01349107
A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.724-732, 2016. ,
DOI : 10.1109/CVPR.2016.85
Improving the Fisher Kernel for Large-Scale Image Classification, Proceedings of the European Conference on Computer Vision, pp.143-156, 2010. ,
DOI : 10.1007/978-3-642-15561-1_11
URL : https://hal.archives-ouvertes.fr/inria-00548630
Object retrieval with large vocabularies and fast spatial matching, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2007. ,
DOI : 10.1109/CVPR.2007.383172
Comparison of human and computer performance across face recognition experiments, Image and Vision Computing, vol.32, issue.1, pp.74-85, 2014. ,
DOI : 10.1016/j.imavis.2013.12.002
Learning to Refine Object Segments, Proceedings of the European Conference on Computer Vision. xv, pp.90-102, 2016. ,
DOI : 10.5244/C.30.15
URL : https://infoscience.epfl.ch/record/224543/files/Pinheiro_ECCV_2016.pdf
Globally-optimal greedy algorithms for tracking a variable number of objects, CVPR 2011, pp.1201-1208, 2011. ,
DOI : 10.1109/CVPR.2011.5995604
Explicit Modeling of Human-Object Interactions in Realistic Videos, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, issue.4, pp.835-848, 2013. ,
DOI : 10.1109/TPAMI.2012.175
URL : https://hal.archives-ouvertes.fr/inria-00626929
Learning object class detectors from weakly annotated video, 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp.62-66, 2012. ,
DOI : 10.1109/CVPR.2012.6248065
URL : https://hal.archives-ouvertes.fr/hal-00695940
Weakly Supervised Learning of Interactions between Humans and Objects, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.3, pp.601-614, 2012. ,
DOI : 10.1109/TPAMI.2011.158
URL : https://hal.archives-ouvertes.fr/inria-00516477
Learning multi-view neighborhood preserving projections, International Conference on Machine Learning, pp.425-432, 2011. ,
Subspace Alignment Based Domain Adaptation for RCNN Detector, Procedings of the British Machine Vision Conference 2015, p.39, 2015. ,
DOI : 10.5244/C.29.166
URL : http://arxiv.org/pdf/1507.05578
Building models of animals from video, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, issue.8, pp.1319-1334, 2006. ,
DOI : 10.1109/TPAMI.2006.155
Discovering discriminative action parts from mid-level video representations, 2012 IEEE Conference on Computer Vision and Pattern Recognition, p.89, 2012. ,
DOI : 10.1109/CVPR.2012.6247807
URL : https://hal.archives-ouvertes.fr/hal-00918807
You Only Look Once: Unified, Real-Time Object Detection, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.779-788, 2016. ,
DOI : 10.1109/CVPR.2016.91
Yolo9000: Better, faster, stronger. arXiv Preprint, p.22, 2016. ,
DOI : 10.1109/cvpr.2017.690
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, Advances in Neural Information Processing Systems. xiii, pp.91-94, 2015. ,
DOI : 10.1109/TPAMI.2016.2577031
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.39, issue.6, p.29, 2015. ,
DOI : 10.1109/TPAMI.2016.2577031
Object Detection Networks on Convolutional Feature Maps, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.39, issue.7, p.22, 2016. ,
DOI : 10.1109/TPAMI.2016.2601099
Action MACH a spatio-temporal Maximum Average Correlation Height filter for action recognition, 2008 IEEE Conference on Computer Vision and Pattern Recognition, p.86, 2008. ,
DOI : 10.1109/CVPR.2008.4587727
URL : http://longwood.cs.ucf.edu/~vision/papers/cvpr2008/7.pdf
LCR-Net: Localizationclassification-regression for human pose, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, p.23, 2017. ,
DOI : 10.1109/cvpr.2017.134
Towards model-based recognition of human movements in image sequences, CVGIP: Image Understanding, vol.59, issue.1, pp.94-115, 1994. ,
Beyond sharing weights for deep domain adaptation. arXiv Preprint, p.43, 2016. ,
ImageNet Large Scale Visual Recognition Challenge, International Journal of Computer Vision, vol.1010, issue.1, p.57, 2015. ,
DOI : 10.1007/978-3-642-15555-0_11
URL : http://dspace.mit.edu/bitstream/1721.1/104944/1/11263_2015_Article_816.pdf
Recognition using visual phrases, CVPR 2011, p.90, 2011. ,
DOI : 10.1109/CVPR.2011.5995711
URL : http://www.cs.rit.edu/%7Erlc/Courses/ImageUnderstanding/Papers/Current/visualPhrases.pdf
Adapting Visual Category Models to New Domains, Proceedings of the European Conference on Computer Vision, pp.213-226, 2010. ,
DOI : 10.1007/978-3-642-15561-1_16
URL : http://www1.icsi.berkeley.edu/~saenko/saenko_eccv_2010.pdf
AMTnet: Action-Micro-Tube Regression by End-to-end Trainable Deep Architecture, 2017 IEEE International Conference on Computer Vision (ICCV), p.49, 2017. ,
DOI : 10.1109/ICCV.2017.473
Deep Learning for Detecting Multiple Space-Time Action Tubes in Videos, Procedings of the British Machine Vision Conference 2016, pp.122-127, 2016. ,
DOI : 10.5244/C.30.58
Image Classification with the Fisher Vector: Theory and Practice, International Journal of Computer Vision, vol.73, issue.2, pp.222-245, 2013. ,
DOI : 10.1007/s11263-006-9794-4
Modeling the Temporal Extent of Actions, Proceedings of the European Conference on Computer Vision, pp.536-548, 2010. ,
DOI : 10.1007/978-3-642-15549-9_39
Recognizing human actions: a local SVM approach, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004., pp.32-36, 2004. ,
DOI : 10.1109/ICPR.2004.1334462
URL : http://www.nada.kth.se/%7Ecaputo/publik/icpr04actions.pdf
Recognizing human actions: a local SVM approach, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004., p.86, 2004. ,
DOI : 10.1109/ICPR.2004.1334462
Integrated recognition, localization and detection using convolutional networks, International Conference on Learning Representations, p.22 ,
Generalized Multiview Analysis: A discriminative latent space, 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp.2160-2167, 2012. ,
DOI : 10.1109/CVPR.2012.6247923
Unsupervised incremental learning for improved object detection in a video, 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp.3298-3305, 2012. ,
DOI : 10.1109/CVPR.2012.6248067
Efficient Detector Adaptation for Object Detection in a Video, 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp.44-56, 2013. ,
DOI : 10.1109/CVPR.2013.418
Weakly Supervised Object Localization Using Things and Stuff Transfer, 2017 IEEE International Conference on Computer Vision (ICCV), 2017. ,
DOI : 10.1109/ICCV.2017.366
Weakly Supervised Object Localization Using Size Estimates, Proceedings of the European Conference on Computer Vision, pp.105-121, 2016. ,
DOI : 10.1007/978-3-642-33786-4_5
Weakly-Shared Deep Transfer Networks for Heterogeneous-Domain Knowledge Propagation, Proceedings of the 23rd ACM international conference on Multimedia, MM '15, pp.35-44, 2015. ,
DOI : 10.1145/2647868.2654914
Two-stream convolutional networks for action recognition in videos, Advances in Neural Information Processing Systems, pp.95-105, 2014. ,
Very deep convolutional networks for largescale image recognition, International Conference on Learning Representations, pp.95-130, 2015. ,
Online real time multiple spatiotemporal action localisation and prediction on a single platform, arXiv Preprint. xvi, pp.127-137, 2017. ,
DOI : 10.1109/iccv.2017.393
On learning to localize objects with minimal supervision, International Conference on Machine Learning, p.23, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00996849
Sliding Shapes for 3D Object Detection in Depth Images, Proceedings of the European Conference on Computer Vision, pp.634-651, 2014. ,
DOI : 10.1007/978-3-319-10599-4_41
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild, CRCV-TR-12-01. iii, xi, xvii, pp.5-6, 2012. ,
Dropout: a simple way to prevent neural networks from overfitting, Journal of Machine Learning Research, vol.15, issue.1, pp.1929-1958, 2014. ,
Crowdsourcing annotations for visual object detection, Workshops at the Twenty-Sixth AAAI Conference on Artificial Intelligence, p.148, 2012. ,
Deep coral: Correlation alignment for deep domain adaptation, Computer Vision?ECCV 2016 Workshops, pp.443-450, 2016. ,
Inception-v4, inceptionresnet and the impact of residual connections on learning. arXiv Preprint, 2016. ,
Going deeper with convolutions, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014. ,
DOI : 10.1109/CVPR.2015.7298594
DeepFace: Closing the Gap to Human-Level Performance in Face Verification, 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp.1701-1708, 2014. ,
DOI : 10.1109/CVPR.2014.220
Shifting weights: Adapting object detectors from image to video, Advances in Neural Information Processing Systems, pp.42-44, 2012. ,
Discriminative Segment Annotation in Weakly Labeled Video, 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp.56-59, 2013. ,
DOI : 10.1109/CVPR.2013.321
URL : http://www.cs.cmu.edu/~rahuls/pub/cvpr2013-crane-rahuls.pdf
Spatiotemporal Deformable Part Models for Action Detection, 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp.2642-2649, 2013. ,
DOI : 10.1109/CVPR.2013.341
URL : http://www.cs.cmu.edu/~rahuls/pub/cvpr2013-sdpm-rahuls.pdf
Weakly-Supervised Semantic Segmentation Using Motion Cues, Proceedings of the European Conference on Computer Vision, pp.388-404, 2016. ,
DOI : 10.1109/TPAMI.2012.120
URL : https://hal.archives-ouvertes.fr/hal-01292794
Learning Motion Patterns in Videos, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.23, 2017. ,
DOI : 10.1109/CVPR.2017.64
URL : https://hal.archives-ouvertes.fr/hal-01427480
Learning Video Object Segmentation with Visual Memory, 2017 IEEE International Conference on Computer Vision (ICCV), 2017. ,
DOI : 10.1109/ICCV.2017.480
URL : https://hal.archives-ouvertes.fr/hal-01511145
Unbiased look at dataset bias, CVPR 2011, pp.75-161, 2011. ,
DOI : 10.1109/CVPR.2011.5995347
URL : http://people.csail.mit.edu/torralba/publications/datasets_cvpr11.pdf
Optimal spatio-temporal path discovery for video event detection, CVPR 2011, pp.3321-3328, 2011. ,
DOI : 10.1109/CVPR.2011.5995416
Max-margin structured output regression for spatiotemporal action localization, Advances in Neural Information Processing Systems, pp.350-358, 2012. ,
Simultaneous Deep Transfer Across Domains and Tasks, 2015 IEEE International Conference on Computer Vision (ICCV), pp.4068-4076, 2015. ,
DOI : 10.1109/ICCV.2015.463
Adversarial discriminative domain adaptation. arXiv preprint, pp.42-113, 2017. ,
DOI : 10.1109/cvpr.2017.316
Deep domain confusion: Maximizing for domain invariance. arXiv Preprint, p.40, 2014. ,
Selective Search for Object Recognition, International Journal of Computer Vision, vol.57, issue.1, pp.27-47, 2013. ,
DOI : 10.1023/B:VISI.0000013087.49260.fb
Visualizing data using t-SNE, p.76, 2008. ,
Sequence to Sequence -- Video to Text, 2015 IEEE International Conference on Computer Vision (ICCV), p.87, 2015. ,
DOI : 10.1109/ICCV.2015.515
URL : http://arxiv.org/pdf/1505.00487
Object localization in ImageNet by looking out of the window, Procedings of the British Machine Vision Conference 2015, p.23, 2015. ,
DOI : 10.5244/C.29.27
Extracting and composing robust features with denoising autoencoders, Proceedings of the 25th international conference on Machine learning, ICML '08, pp.1096-1103, 2008. ,
DOI : 10.1145/1390156.1390294
Show and tell: A neural image caption generator, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.90, 2015. ,
DOI : 10.1109/CVPR.2015.7298935
URL : http://arxiv.org/pdf/1411.4555
Rapid object detection using a boosted cascade of simple features, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, p.89, 2001. ,
DOI : 10.1109/CVPR.2001.990517
Error bounds for convolutional codes and an asymptotically optimum decoding algorithm, IEEE Transactions on Information Theory, vol.13, issue.2, pp.260-269, 1967. ,
DOI : 10.1109/TIT.1967.1054010
Semantic segmentation of urban scenes by learning local class interactions, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp.1-9, 2015. ,
DOI : 10.1109/CVPRW.2015.7301377
Large-Margin Multi-Modal Deep Learning for RGB-D Object Recognition, IEEE Transactions on Multimedia, vol.17, issue.11, pp.1887-1898, 2015. ,
DOI : 10.1109/TMM.2015.2476655
Dense Trajectories and Motion Boundary Descriptors for Action Recognition, International Journal of Computer Vision, vol.73, issue.2, pp.60-79, 2013. ,
DOI : 10.1007/s11263-006-9794-4
URL : https://hal.archives-ouvertes.fr/hal-00725627
A Robust and Efficient Video Representation for Action Recognition, International Journal of Computer Vision, vol.103, issue.1, p.89, 2015. ,
DOI : 10.1109/ICCV.2013.442
URL : https://hal.archives-ouvertes.fr/hal-01145834
Video Action Detection with Relational Dynamic-Poselets, Proceedings of the European Conference on Computer Vision, pp.565-580, 2014. ,
DOI : 10.1007/978-3-319-10602-1_37
Actionness Estimation Using Hybrid Fully Convolutional Networks, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.142, 2016. ,
DOI : 10.1109/CVPR.2016.296
Unsupervised Learning of Visual Representations Using Videos, 2015 IEEE International Conference on Computer Vision (ICCV), pp.2794-2802, 2015. ,
DOI : 10.1109/ICCV.2015.320
Detection by detections: Non-parametric detector adaptation for a video, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.350-357, 2012. ,
Learning to track for spatiotemporal action localization, Proceedings of the International Conference on Computer Vision. xiii, pp.127-129, 2015. ,
DOI : 10.1109/iccv.2015.362
URL : https://hal.archives-ouvertes.fr/hal-01159941
Human action localization with sparse spatial supervision, p.150, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01317558
Evaluation of super-voxel methods for early video processing, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.1202-1209, 2012. ,
Actor-action semantic segmentation with groupingprocess models, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, p.90, 2016. ,
DOI : 10.1109/cvpr.2016.336
URL : http://arxiv.org/pdf/1512.09041
Can humans fly? Action understanding with multiple classes of actors, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.88-90, 2015. ,
DOI : 10.1109/CVPR.2015.7298839
Incremental Domain Adaptation of Deformable Part-based Models, Proceedings of the British Machine Vision Conference 2014, pp.2367-2380, 2014. ,
DOI : 10.5244/C.28.120
Recognizing human action in timesequential images using hidden markov model, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.379-385, 1992. ,
DOI : 10.1109/cvpr.1992.223161
Adapting SVM Classifiers to Data with Shifted Distributions, Seventh IEEE International Conference on Data Mining Workshops (ICDMW 2007), pp.69-76, 2007. ,
DOI : 10.1109/ICDMW.2007.37
URL : http://repository.cmu.edu/cgi/viewcontent.cgi?article=1943&context=compsci
Cross-domain video concept detection using adaptive svms, Proceedings of the 15th international conference on Multimedia , MULTIMEDIA '07, pp.188-197, 2007. ,
DOI : 10.1145/1291233.1291276
Human action recognition by learning bases of action attributes and parts, 2011 International Conference on Computer Vision, p.90, 2011. ,
DOI : 10.1109/ICCV.2011.6126386
Describing Videos by Exploiting Temporal Structure, 2015 IEEE International Conference on Computer Vision (ICCV), p.87, 2015. ,
DOI : 10.1109/ICCV.2015.512
AttentionNet: Aggregating Weak Directions for Accurate Object Detection, 2015 IEEE International Conference on Computer Vision (ICCV), pp.2659-2667, 2015. ,
DOI : 10.1109/ICCV.2015.305
URL : http://arxiv.org/pdf/1506.07704
Fast action proposals for human action detection and search, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.121, 2015. ,
DOI : 10.1109/CVPR.2015.7298735
Discriminative subvolume search for efficient action detection, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.2442-2449, 2009. ,
Visualizing and Understanding Convolutional Networks, Proceedings of the European Conference on Computer Vision, pp.818-833, 2014. ,
DOI : 10.1007/978-3-319-10590-1_53
URL : http://cs.nyu.edu/%7Efergus/papers/zeilerECCV2014.pdf
Summary Transfer: Exemplar-Based Subset Selection for Video Summarization, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.1059-1067, 2016. ,
DOI : 10.1109/CVPR.2016.120
URL : http://arxiv.org/pdf/1603.03369
Flow-guided feature aggregation for video object detection. arXiv Preprint, p.51, 2017. ,
DOI : 10.1109/iccv.2017.52
Deep Feature Flow for Video Recognition, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), p.50, 2017. ,
DOI : 10.1109/CVPR.2017.441
Edge Boxes: Locating Object Proposals from Edges, Proceedings of the European Conference on Computer Vision, pp.391-405, 2014. ,
DOI : 10.1007/978-3-319-10602-1_26
URL : http://research.microsoft.com/en-us/um/people/larryz/ZitnickDollarECCV14edgeBoxes.pdf
Chained multistream networks exploiting pose, motion, and appearance for action classification and detection, p.148, 2017. ,
DOI : 10.1109/iccv.2017.316