,
Incremental Learning of Object Detectors without Catastrophic Forgetting, p.2017 ,
URL : https://hal.archives-ouvertes.fr/hal-01573623
Weakly-Supervised Semantic Segmentation using Motion Cues, ECCV, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01292794
Discriminative Actions for Recognising Events, ICVGIP, 2006. ,
, Dynamic Events as Mixtures of Spatial and Temporal Features, ICVGIP, 2006.
Dynamic Hybrid Algorithms for MAP Inference in Discrete MRFs, Trans. PAMI, vol.32, issue.10, pp.1846-1857, 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-01216727
, Reduce, Reuse & Recycle: Efficiently Solving Multi-Label MRFs, 2008.
Geometric and Stochastic Error Minimisation in Motion Tracking, 2004. ,
Discriminant Substrokes for Online Handwriting Recognition, ICDAR, 2005. ,
, Learning Mixtures of Offline and Online features for Handwritten Stroke Recognition, ICPR, 2006.
Pose Estimation and Segmentation of People in 3D Movies, ICCV, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00874884
Endto-End Incremental Learning, ECCV, 2018. ,
Mixing Body-Part Sequences for Human Pose Estimation, CVPR, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00978643
Learning Graphs to Match, ICCV, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00875105
Occlusion and Motion Reasoning for Longterm Tracking, ECCV, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01020149
, Online Object Tracking with Proposal Selection, ICCV, 2015.
What, Where & How Many? Combining Object Detectors and CRFs, 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-01216730
Track to the Future: Spatiotemporal Video Segmentation with Long-range Motion Cues, CVPR, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00817961
Enhancing Energy Minimization Framework for Scene Text Recognition with Top-Down Cues, vol.145, pp.30-42, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01263322
, Image Retrieval using Textual Cues, ICCV, 2013.
, Scene Text Recognition using Higher Order Language Priors, BMVC, 2012.
, Top-Down and Bottom-Up Cues for Scene Text Recognition, CVPR, 2012.
Exact Inference in Multi-label CRFs with Higher Order Cliques, CVPR, 2008. ,
URL : https://hal.archives-ouvertes.fr/hal-01217304
Scene Text Recognition and Retrieval for Large Lexicons, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01088739
Recognizing Human Activities from Constituent Actions, National Conf. Communications, 2005. ,
Pose Estimation and Segmentation of Multiple People in Stereoscopic Movies, Trans. PAMI, vol.37, issue.8, pp.1643-1655, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01089660
How good is my GAN?, ECCV, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01850447
, Incremental Learning of Object Detectors without Catastrophic Forgetting, ICCV, 2017.
Combining Appearance and Structure from Motion Features for Road Scene Understanding, BMVC, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-01216879
Learning Motion Patterns in Videos, CVPR, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01427480
, Learning Video Object Segmentation with Visual Memory, 2017.
, Weakly-Supervised Semantic Segmentation using Motion Cues, ECCV, 2016.
Learning to Segment Moving Objects, IJCV, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01653720
, ICDAR 2003 datasets
, Street View Text dataset
On seeing stuff: the perception of materials by humans and machines, Proc. SPIE, vol.4299, pp.1-12, 2001. ,
People-tracking-by-detection and peopledetection-by-tracking, CVPR, 2008. ,
, Pictorial Structures Revisited: People Detection and Articulated Pose Estimation, CVPR, 2009.
Objects that Sound, ECCV, 2018. ,
Contour Detection and Hierarchical Image Segmentation, Trans. PAMI, vol.33, issue.5, pp.898-916, 2011. ,
Multiscale Combinatorial Grouping, CVPR, 2014. ,
Ensemble Tracking, Trans. PAMI, vol.29, issue.2, pp.261-271, 2007. ,
Robust Object Tracking with Online Multiple Instance Learning, Trans. PAMI, vol.33, issue.8, pp.1619-1632, 2011. ,
Diverse M-best solutions in Markov random fields, ECCV, 2012. ,
Inside-outside net: Detecting objects in context with skip pooling and recurrent neural networks, CVPR, 2016. ,
Dynamic and Contextual Information in HMM Modeling for Handwritten Word Recognition, Trans. PAMI, vol.33, issue.10, pp.2066-2080, 2011. ,
It's Moving! A Probabilistic Model for Causal Motion Segmentation in Moving Camera Videos, ECCV, 2016. ,
PhotoOCR: Reading Text in Uncontrolled Conditions, ICCV, 2013. ,
Interactive Image Segmentation Using an Adaptive GMMRF Model, ECCV, 2004. ,
Pseudo-Boolean optimization, Discrete Applied Mathematics, 2002. ,
URL : https://hal.archives-ouvertes.fr/hal-01150533
Interactive Graph Cuts for Optimal Boundary and Region Segmentation of Objects in N-D Images, ICCV, 2001. ,
Fast Approximate Energy Minimization via Graph Cuts, Trans. PAMI, vol.23, issue.11, pp.1222-1239, 2001. ,
Video object segmentation by tracking regions, ICCV, 2009. ,
Object Segmentation by Long Term Analysis of Point Trajectories, ECCV, 2010. ,
Large displacement optical flow: Descriptor matching in variational motion estimation, Trans. PAMI, vol.33, issue.3, pp.510-513, 2011. ,
One-Shot Video Segmentation, 2017. ,
Incremental and Decremental Support Vector Machine Learning, NIPS, 2000. ,
Semantic image segmentation with deep convolutional nets and fully connected CRFs, 2015. ,
, DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs, Trans. PAMI, vol.40, issue.4, pp.834-848, 2018.
NEIL: Extracting Visual Knowledge from Web Data, ICCV, 2013. ,
Mean shift: A robust approach toward feature space analysis, Trans. PAMI, 2002. ,
Histograms of Oriented Gradients for Human Detection, CVPR, 2005. ,
URL : https://hal.archives-ouvertes.fr/inria-00548512
Accurate Scale Estimation for Robust Visual Tracking, BMVC, 2014. ,
Fast Approximate Energy Minimization with Label Costs, CVPR, 2010. ,
Spatio-temporal segmentation of video by hierarchical mean shift analysis, Statistical Methods in Video Processing Workshop, 2002. ,
Discriminative Models for Multi-class Object Layout, ICCV, 2009. ,
Learning Everything about Anything: Webly-Supervised Visual Concept Learning, CVPR, 2014. ,
Structured Forests for Fast Edge Detection, ICCV, 2013. ,
FlowNet: Learning Optical Flow with Convolutional Networks, ICCV, 2015. ,
2D Articulated Human Pose Estimation and Retrieval in (Almost) Unconstrained Still Images, IJCV, vol.99, issue.2, pp.190-214, 2012. ,
A comprehensive neural-based approach for text recognition in videos using natural language processing, ICMR, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00645219
Category-Independent Object Proposals with Diverse Ranking, Trans. PAMI, vol.36, issue.2, pp.222-234, 2014. ,
Detecting Text in Natural Scenes with Stroke Width Transform, CVPR, 2010. ,
The Pascal Visual Object Classes Challenge: A Retrospective, IJCV, vol.111, issue.1, pp.98-136, 2015. ,
Hello! My name is ,
Automatic naming of characters in TV video, BMVC, 2006. ,
Video Segmentation by Non-Local Consensus Voting, BMVC, 2014. ,
Learning hierarchical features for scene labeling, Trans. PAMI, vol.35, issue.8, pp.1915-1929, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00742077
Improving Open-Vocabulary Scene Text Recognition, ICDAR, 2013. ,
Efficient Graph-Based Image Segmentation, IJCV, vol.59, issue.2, pp.167-181, 2004. ,
Object Detection with Discriminatively Trained Part Based Models, Trans. PAMI, vol.32, issue.9, pp.1627-1645, 2010. ,
Pictorial structures for object recognition, IJCV, vol.61, issue.1, pp.55-79, 2005. ,
Distance Transforms of Sampled Functions, Theory of Computing, vol.8, 2012. ,
Progressive search space reduction for human pose estimation, CVPR, 2008. ,
The representation and matching of pictorial structures, IEEE Trans. Computers, vol.100, issue.1, pp.67-92, 1973. ,
Pose from Flow and Flow from Pose, CVPR, 2013. ,
TURN TAP: Temporal Unit Regression Networks for Temporal Action Proposals, 2017. ,
Fast R-CNN, ICCV, 2015. ,
Rich feature hierarchies for accurate object detection and semantic segmentation, CVPR, 2014. ,
Scene Text Recognition: No Country for Old Men?, ACCV Workshops, 2014. ,
Generative adversarial nets, NIPS, 2014. ,
Hybrid speech recognition with deep bidirectional LSTM, Workshop on Automatic Speech Recognition and Understanding, 2013. ,
, Hybrid computing using a neural network with dynamic external memory, 2016.
Efficient Hierarchical GraphBased Video Segmentation, CVPR, 2010. ,
Humanising GrabCut: Learning to segment humans using the Kinect, ICCV Workshop Consumer Depth Cameras for Computer Vision, 2011. ,
BranchOut: Regularization for Online Ensemble Tracking with CNNs, 2017. ,
Struck: Structured output tracking with kernels, ICCV, 2011. ,
Multiple View Geometry in Computer Vision, 2004. ,
Deep residual learning for image recognition, CVPR, 2016. ,
High-Speed Tracking with Kernelized Correlation Filters, Trans. PAMI, vol.37, issue.3, pp.583-596, 2015. ,
Distilling the knowledge in a neural network, NIPS, 2014. ,
Long Short-term Memory, Neural Computation, vol.9, issue.8, pp.1735-1780, 1997. ,
Weakly Supervised Semantic Segmentation using Web-Crawled Videos, 2017. ,
Method and means for recognizing complex patterns, US Patent 3,069,654, 1962. ,
Flownet 2.0: Evolution of optical flow estimation with deep networks, 2017. ,
Reading Text in the Wild with Convolutional Neural Networks, IJCV, vol.116, issue.1, pp.1-20, 2016. ,
Deep Features for Text Spotting, ECCV, 2014. ,
Fusionseg: Learning to combine motion and appearance for fully automatic segmention of generic objects in videos, 2017. ,
Learning to Predict Where Humans Look, ICCV, 2009. ,
Tracking-Learning-Detection, Trans. PAMI, vol.34, issue.7, pp.1409-1422, 2012. ,
A New Approach to Linear Filtering and Prediction Problems, J. Fluids Engineering, 1960. ,
, ICDAR 2013 Robust Reading Competition, 2013.
The Benefits of Dense Stereo for Pedestrian Detection, IEEE Trans. Intell. Transp. Syst, vol.12, issue.4, pp.1096-1106, 2011. ,
Motion trajectory segmentation via minimum cost multicuts, ICCV, 2015. ,
Classifier Based Graph Construction for Video Segmentation, CVPR, 2015. ,
Learning Video Object Segmentation from Static Images, 2017. ,
Overcoming catastrophic forgetting in neural networks, 2017. ,
Primary Object Segmentation in Videos Based on Region Augmentation and Reduction, 2017. ,
Bi-layer segmentation of binocular stereo video, CVPR, 2005. ,
What energy functions can be minimized via graph cuts, Trans. PAMI, vol.26, issue.2, pp.147-159, 2004. ,
Convergent Tree-Reweighted Message Passing for Energy Minimization, Trans. PAMI, vol.28, issue.10, pp.1568-1583, 2006. ,
MRF Optimization via Dual Decomposition: Message-Passing Revisited, ICCV, 2007. ,
A viewercentric editor for 3D movies, Computer Graphics and Applications, vol.31, issue.1, pp.20-35, 2011. ,
Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials, NIPS, 2011. ,
ImageNet classification with deep convolutional neural networks, NIPS, 2012. ,
NESP: Nonlinear enhancement and selection of plane for optimal segmentation and recognition of scene word images, 2013. ,
Learning Layered Motion Segmentations of Video, ICCV, 2005. ,
Human Pose Estimation using a Joint Pixel-wise and Part-wise Formulation, CVPR, 2013. ,
Conditional Random Fields: Probabilistic models for segmenting and labelling sequence data, ICML, 2001. ,
Human pose tracking in monocular sequence using multilevel structured models, Trans. PAMI, vol.31, issue.1, pp.27-38, 2009. ,
Key-segments for video object segmentation, ICCV, 2011. ,
Learning without forgetting, ECCV, 2016. ,
Microsoft COCO: Common objects in context, 2014. ,
Beyond Pixels: Exploring New Representations and Applications for Motion Analysis, 2009. ,
Fully convolutional networks for semantic segmentation, CVPR, 2015. ,
Matching theory, 1986. ,
The Visual Object Tracking VOT2014 challenge results, ECCV Visual Object Tracking Challenge Workshop, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01301090
Actions in Context, CVPR, 2009. ,
The Template Update Problem, Trans. PAMI, vol.26, issue.6, pp.810-815, 2004. ,
A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation, CVPR, 2016. ,
Catastrophic interference in connectionist networks: The sequential learning problem, Psychology of learning and motivation, vol.24, pp.109-165, 1989. ,
Distance-Based Image Classification: Generalizing to New Classes at Near-Zero Cost, Trans. PAMI, vol.35, issue.11, pp.2624-2637, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00817211
Beyond Bounding-Boxes: Learning Object Shape by Model-Driven Grouping, ECCV, 2012. ,
Feedforward Semantic Segmentation With Zoom-Out Features, CVPR, 2015. ,
Loopy belief propagation for approximate inference: An empirical study, 1999. ,
Twenty Years of Document Image Analysis in PAMI, Trans. PAMI, vol.22, issue.1, pp.38-62, 2000. ,
Coherent Motion Segmentation in Moving Camera Videos Using Optical Flow Orientations, ICCV, 2013. ,
Consensus-based matching and tracking of keypoints for object tracking, 2014. ,
A Method for Text Localization and Recognition in Real-World Images, 2010. ,
, A real-time scene text to speech system, ECCV workshops, 2012.
, On Combining Multiple Segmentations in Scene Text Recognition, ICDAR, 2013.
, Real-time scene text localization and recognition, CVPR, 2012.
Efficient Extraction of Human Motion Volumes by Tracking, CVPR, 2010. ,
Large-Lexicon AttributeConsistent Text Recognition in Natural Images, ECCV, 2012. ,
Audio-Visual Scene Analysis with Self-Supervised Multisensory Features, ECCV, 2018. ,
PersonLab: Person Pose Estimation and Instance Segmentation with a Part-Based Geometric Embedding Model, ECCV, 2018. ,
Towards Accurate Multi-person Pose Estimation in the Wild, 2017. ,
Weakly-and semisupervised learning of a DCNN for semantic image segmentation, ICCV, 2015. ,
Fast object segmentation in unconstrained video, ICCV, 2013. ,
N-best maximal decoders for part models, ICCV, 2011. ,
Constrained Convolutional Neural Networks for Weakly Supervised Segmentation, ICCV, 2015. ,
Fully convolutional multi-class multiple instance learning, ICLR, 2015. ,
Probabilistic Reasoning in Intelligent Systems : Networks of Plausible Inference, 1988. ,
A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation, CVPR, 2016. ,
Learning to refine object segments, ECCV, 2016. ,
From Image-level to Pixel-level Labeling with Convolutional Networks, CVPR, 2015. ,
Probabilistic Outputs for Support Vector Machines and Comparisons to Regularized Likelihood Methods, Advances in Large Margin Classifiers, 1999. ,
Learn++: An incremental learning algorithm for supervised neural networks, IEEE Trans. Systems, Man, and Cybernetics, Part C, vol.31, issue.4, pp.497-508, 2001. ,
Learning object class detectors from weakly annotated video, CVPR, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00695940
Deep context: End-to-end contextual speech recognition, IEEE Spoken Lang. Tech, 2018. ,
Strike a pose: Tracking people by finding stylized poses, CVPR, 2005. ,
Connectionist models of recognition memory: constraints imposed by learning and forgetting functions, Psychological review, vol.97, issue.2, p.285, 1990. ,
, iCaRL: Incremental Classifier and Representation Learning, 2017.
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, NIPS, 2015. ,
RGB-(D) Scene Labeling: Features and Algorithms, CVPR, 2012. ,
EpicFlow: Edge-Preserving Interpolation of Correspondences for Optical Flow, CVPR, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01142656
A Contextual Postprocessing System for Error Correction Using Binary n-Grams, IEEE Trans. Comput, pp.480-493, 1974. ,
Incremental Learning of NCM Forests for Large-Scale Image Classification, CVPR, 2014. ,
A database for fine grained activity detection of cooking activities, CVPR, 2012. ,
U-Net: Convolutional Networks for Biomedical Image Segmentation, 2015. ,
Grabcut: Interactive foreground extraction using iterated graph cuts, ACM Trans. Graphics, vol.23, issue.3, pp.309-314, 2004. ,
Using Multiple Segmentations to Discover Objects and their Extent in Image Collections, CVPR, 2006. ,
Exact and Approximate Inference in Associative Hierarchical Networks using Graph Cuts, 2010. ,
Particle Video: Long-Range Motion Estimation Using Point Trajectories, IJCV, vol.80, issue.1, 2008. ,
Cascaded models for articulated pose estimation, ECCV, 2010. ,
Parsing human motion with stretchable models, CVPR, 2011. ,
A Case Study of Incremental Concept Induction, 1986. ,
ICDAR 2011 Robust Reading Competition Challenge 2: Reading Text in Scene Images, ICDAR, 2011. ,
A Robust Stereo Prior for Human Segmentation, 2012. ,
Scene Text Recognition Using Part-Based Tree-Structured Character Detection, CVPR, 2013. ,
Motion segmentation and tracking using normalized cuts, ICCV, 1998. ,
, Normalized Cuts and Image Segmentation, Trans. PAMI, vol.22, issue.8, pp.888-905, 2000.
Real-time human pose recognition in parts from single depth images, CVPR, 2011. ,
TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context, IJCV, vol.81, issue.1, pp.2-23, 2009. ,
Stochastic Tracking of 3D Human Figures Using 2D Image Motion, ECCV, 2000. ,
Two-Stream Convolutional Networks for Action Recognition in Videos, NIPS, 2014. ,
, Very Deep Convolutional Networks for Large-Scale Image Recognition, ICLR, 2015.
Estimating articulated human motion with covariance scaled sampling, IJRR, vol.22, issue.6, pp.371-391, 2003. ,
URL : https://hal.archives-ouvertes.fr/inria-00548242
Variational mixture smoothing for non-linear dynamical systems, CVPR, 2004. ,
Learning to Extract Object Boundaries using Motion Cues, ICCV, 2007. ,
Actor-centric Relation Network, ECCV, 2018. ,
Dense point trajectories by GPUaccelerated large displacement optical flow, ECCV, 2010. ,
Self-Paced Learning for Long-Term Tracking, CVPR, 2013. ,
An embedded application for degraded text recognition, EURASIP J. Applied Signal Processing, pp.2127-2135, 2005. ,
Is learning the n-th thing any easier than learning the first?, NIPS, 1996. ,
Detection and tracking of point features, 1991. ,
Multiple Hypothesis Video Segmentation from Superpixel Flows, ECCV, 2010. ,
Efficient Additive Kernels via Explicit Feature Maps, Trans. PAMI, vol.34, issue.3, pp.480-492, 2012. ,
Weakly supervised structured output learning for semantic segmentation, CVPR, 2012. ,
Rapid object detection using a boosted cascade of simple features, CVPR, 2001. ,
Dense trajectories and motion boundary descriptors for action recognition, IJCV, vol.103, issue.1, pp.60-79, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00725627
Gaussian process dynamical models for human motion, Trans. PAMI, 2008. ,
End-to-End Scene Text recognition, ICCV, 2011. ,
Toward Integrated Scene Text Reading, Trans. PAMI, vol.36, issue.2, pp.375-387, 2014. ,
Scene Text Recognition Using Similarity and a Lexicon with Sparse Belief Propagation, Trans. PAMI, vol.31, issue.10, pp.1733-1746, 2009. ,
Learning to Detect Motion Boundaries, CVPR, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01142653
Sidestepping intractable inference with structured ensemble cascades, NIPS, 2010. ,
Smoothness in layers: Motion segmentation using nonparametric mixture estimation, CVPR, 1997. ,
Correctness of local probability propagation in graphical models with loops, Neural computation, 2000. ,
Herding Dynamical Weights to Learn, ICML, 2009. ,
Backpropagation through time: What it does and how to do it, Proc. IEEE, vol.78, pp.1550-1560, 1990. ,
Memory Networks, ICLR, 2015. ,
MILCut: A Sweeping Line Multiple Instance Learning Paradigm for Interactive Image Segmentation, CVPR, 2014. ,
Online Object Tracking: A Benchmark, CVPR, 2013. ,
Dynamic Memory Networks for Visual and Textual Question Answering, ICML, 2016. ,
Multi-Graph Matching via Affinity Optimization with Graduated Consistency Regularization, Trans. PAMI, 2016. ,
Layered Object Models for Image Segmentation, Trans. PAMI, vol.34, issue.9, pp.1731-1743, 2011. ,
Articulated Human Detection with Flexible Mixturesof-Parts, Trans. PAMI, 2012. ,
, Articulated Pose Estimation using Flexible Mixtures of Parts, CVPR, 2011.
Modeling Mutual Context of Object and Human Pose in Human-Object Interaction Activities, CVPR, 2010. ,
Strokelets: A Learned Multi-scale Representation for Scene Text Recognition, CVPR, 2014. ,
Text Detection and Recognition in Imagery: A survey, Trans. PAMI, vol.37, issue.7, pp.1480-1500, 2015. ,
The Sound of Pixels, ECCV, 2018. ,
, Conditional Random Fields as Recurrent Neural Networks, 2015.
Edge Boxes: Locating Object Proposals from Edges, ECCV, 2014. ,
Consistent segmentation for optical flow estimation, ICCV, 2005. ,