Human motion analysis: A review, Computer Vision and Image Understanding, vol.73, issue.3, p.30, 1999. ,
Pose-conditioned joint angle limits for 3D human pose reconstruction, CVPR, p.22, 2015. ,
Optical flow-based 3D human motion estimation from monocular video, GCPR, p.26, 2017. ,
Shape quantization and recognition with randomized trees, Neural Computation, vol.9, issue.7, p.19, 1997. ,
Pictorial structures revisited: People detection and articulated pose estimation, CVPR, p.19, 2009. ,
Discriminative appearance models for pictorial structures, International Journal of Computer Vision, vol.99, p.19, 2012. ,
2D human pose estimation: New benchmark and state of the art analysis, CVPR, vol.71, p.74, 2014. ,
Learning Models of Shape from 3D Range Data, p.27, 2005. ,
SCAPE: Shape completion and animation of people, SIGGRAPH, vol.26, p.65, 2005. ,
A human body modelling system for motion studies, IEEE, vol.11, p.18, 1979. ,
Temporal Scene Analysis: Conceptual Descriptions of Object Movements, vol.29, p.30, 1975. ,
Detailed human shape and pose from images, CVPR, vol.26, p.64, 2007. ,
Motion capture of hands in action using discriminative salient points, ECCV, p.176, 2012. ,
Delving deeper into convolutional networks for learning video representations, ICLR, p.35, 2016. ,
Pose-conditioned spatio-temporal attention for human action recognition. CoRR, abs/1703.10106, vol.125, p.139, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01593548
Glimpse clouds: Human activity recognition from unstructured feature points, CVPR, vol.141, p.143, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01713109
Looking beyond appearances: Synthetic training data for deep CNNs in re-identification, Computer Vision and Image Understanding, vol.167, p.74, 2018. ,
Efficient method for contour tracking using active shape models, Motion of Non-Rigid and Articulated Obgects Workshop, p.31, 1994. ,
Generating spatiotemporal models from examples, Image and Vision Computing, vol.14, p.31, 1996. ,
Dynamic image networks for action recognition, CVPR, vol.35, p.100, 2016. ,
Learning parameterized models of image motion, CVPR, p.32, 1997. ,
, Blender -a 3D modelling and rendering package, vol.42, p.185
Movement, activity and action: the role of knowledge in the perception of motion, Philosophical transactions of the Royal Society of London. Series B, Biological sciences, vol.352, pp.1257-65, 1997. ,
, , p.87
Keep it SMPL: Automatic estimation of 3D human pose and shape from a single image, ECCV, vol.64, p.80, 2016. ,
Learning and recognizing human dynamics in video sequences, CVPR, p.32, 1997. ,
Tracking people with twists and exponential maps, CVPR, p.31, 1998. ,
Signature verification using a "Siamese" time delay neural network, NIPS, p.127, 1993. ,
High accuracy optical flow estimation based on a theory for warping, ECCV, vol.101, p.103, 2004. ,
A naturalistic open source movie for optical flow evaluation, ECCV, p.74, 2012. ,
Activitynet: A large-scale video benchmark for human activity understanding, CVPR, vol.154, p.155, 2015. ,
Weakly-supervised 3D hand pose estimation from monocular RGB images, ECCV, vol.196, p.197, 2018. ,
Pose-robust face recognition via deep residual equivariant mapping, CVPR, vol.127, p.134, 2018. ,
Realtime multi-person 2D pose estimation using part affinity fields, CVPR, vol.21, p.62, 2017. ,
, , vol.43, p.184
Quo vadis, action recognition? A new model and the Kinetics dataset, CVPR, vol.35, p.128, 2017. ,
Human pose estimation with iterative error feedback, CVPR, p.20, 2016. ,
Motion-based recognition a survey, Image and Vision Computing, vol.13, issue.2, p.29, 1995. ,
, An information-rich 3D model repository, vol.176, p.184, 2015.
Automatic and efficient human pose estimation for sign language videos, International Journal of Computer Vision, p.19, 2013. ,
3D human pose estimation = 2D pose estimation + matching, CVPR, p.25, 2017. ,
Collecting highly parallel data for paraphrase evaluation, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol.1, p.154, 2011. ,
Semantic image segmentation with deep convolutional nets and fully connected CRFs, ICLR, p.24, 2015. ,
Attention to scale: Scale-aware semantic image segmentation, CVPR, vol.25, p.46, 2016. ,
Synthesizing training images for boosting human 3D pose estimation, vol.3, p.74, 2016. ,
Single-image depth perception in the wild, NIPS, vol.25, p.47, 2016. ,
Microsoft COCO captions: Data collection and evaluation server, p.167, 2015. ,
Articulated pose estimation by a graphical model with image dependent pairwise relations, NIPS, p.20, 2014. ,
Detect what you can: Detecting and representing objects using holistic models and body parts, CVPR, vol.24, p.25, 2014. ,
3D-R2N2: A unified approach for single and multi-view 3D object reconstruction, ECCV, 0197. ,
,
Bullet real-time physics simulation, 2013. ,
Visual categorization with bags of keypoints, p.33, 2004. ,
Body part detectors trained using 3d human pose annotations, ICCV, p.19, 2009. ,
Human detection using oriented histograms of flow and appearance, ECCV, p.34, 2006. ,
URL : https://hal.archives-ouvertes.fr/inria-00548587
Model-based 3D hand pose estimation from monocular video, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.33, issue.9, p.173, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00856313
Global context aware local features for robust 3D point matching, CVPR, p.63, 2018. ,
ImageNet: A large-scale hierarchical image database, CVPR, vol.97, p.151, 2009. ,
Articulated body motion capture by annealed particle filtering, CVPR, p.31, 2000. ,
Exploring nearest neighbor approaches for image captioning, p.168, 2015. ,
HS-Nets: Estimating human body shape from silhouettes with convolutional neural networks, vol.3, p.28, 2016. ,
Monocular RGB hand pose inference from unsupervised refinable nets, CVPR Workshops, vol.173, p.175, 2018. ,
Long-term recurrent convolutional networks for visual recognition and description, CVPR, vol.34, p.100, 2015. ,
Learning optical flow with convolutional networks, ICCV, p.39, 2015. ,
Marker-less 3D human motion capture with monocular image sequence and height-maps, ECCV, p.40, 2016. ,
Learning actionable representations from visual observations, In IROS, p.127, 2018. ,
Recognizing action at a distance, ICCV, p.32, 2003. ,
2D articulated human pose estimation and retrieval in (almost) unconstrained still images, International Journal of Computer Vision, vol.99, issue.2, p.19, 2012. ,
Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture, ICCV, vol.25, p.46, 2015. ,
Depth map prediction from a single image using a multi-scale deep network, NIPS, vol.25, p.48, 2014. ,
, The PASCAL Visual Object Classes Challenge 2010 (VOC2010) Results
Learning to be a depth camera for close-range human capture and interaction, SIGGRAPH, p.39, 2014. ,
Learning hierarchical features for scene labeling, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, p.24, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00742077
Learning to recognize activities from the wrong view point, ECCV, p.125, 2008. ,
Two-frame motion estimation based on polynomial expansion, SCIA, vol.101, p.103, 2003. ,
Convolutional two-stream network fusion for video action recognition, CVPR, vol.35, p.122, 2016. ,
The grasp taxonomy of human grasp types. Human-Machine Systems, IEEE Transactions on, p.203, 2016. ,
A discriminatively trained, multiscale, deformable part model, CVPR, p.19, 2008. ,
Object detection with discriminatively trained part based models. Pattern Analysis and Machine Intelligence, vol.32, p.19, 2010. ,
Pictorial structures for object recognition, International Journal of Computer Vision, vol.61, p.18, 2005. ,
Determining the best suited semantic events for cognitive surveillance, Expert Systems with Applications, vol.38, issue.4, pp.4068-4079, 2011. ,
Modeling video evolution for action recognition, CVPR, p.34, 2015. ,
Planning optimal grasps, ICRA, p.203, 1992. ,
Progressive search space reduction for human pose estimation, CVPR, p.19, 2008. ,
2D human pose estimation in TV shows, Statistical and Geometrical Approaches to Visual Motion Analysis, p.154, 2009. ,
The representation and matching of pictorial structures, IEEE Transactions on Computers, C, vol.22, issue.1, p.18, 1973. ,
Body plans, CVPR, vol.18, 1997. ,
From lifestyle VLOGs to everyday interactions, In CVPR, issue.8, 2018. ,
Virtual worlds as proxy for multiobject tracking analysis, CVPR, p.40, 2016. ,
First-person hand action benchmark with RGB-D videos and 3D hand pose annotations, CVPR, vol.173, p.195, 2018. ,
Towards 3-D model-based tracking and recognition of human movement: a multi-view approach, Int. Workshop on Face and Gesture Recognition, p.31, 1995. ,
The visual analysis of human movement: A survey. Computer Vision and Image Understanding, vol.73, p.29, 1999. ,
Vision meets robotics: The KITTI dataset, International Journal of Robotics Research, vol.32, p.25, 2013. ,
Learning camera viewpoint using cnn to improve 3D body pose estimation, vol.3, p.74, 2016. ,
Learning a predictable and generative vector representation for objects, ECCV, p.63, 2016. ,
Rich feature hierarchies for accurate object detection and semantic segmentation, CVPR, vol.97, p.99, 2014. ,
Using k-poselets for detecting people and localizing their keypoints, CVPR, p.19, 2014. ,
The Columbia grasp database, ICRA, vol.183, p.203, 2009. ,
Look into person: Self-supervised structure-sensitive learning and a new benchmark for human parsing, CVPR, p.25, 2017. ,
Human Sequence Evaluation: The Key-frame Approach, 2004. ,
Action recognition with a large number of classes, vol.151, p.155, 2015. ,
Actions as spacetime shapes, Transactions on Pattern Analysis and Machine Intelligence, vol.29, issue.12, p.152, 2007. ,
Spherical harmonic lighting: The gritty details, Archives of the Game Developers Conference, vol.56, p.44, 2003. ,
AtlasNet: A Papier-Mâché Approach to Learning 3D Surface Generation, CVPR, vol.63, 0198. ,
, 3D correspondences by deep deformation, vol.179, p.199, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01830474
AVA: A video dataset of spatio-temporally localized atomic visual actions, CVPR, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01764300
Estimating human shape and pose from a single image, ICCV, vol.26, p.64, 2009. ,
DenseReg: Fully convolutional dense shape regression in-the-wild, CVPR, p.66, 2017. ,
DensePose: Dense human pose estimation in the wild, CVPR, vol.25, p.81, 2018. ,
3D pose from motion for cross-view action recognition via non-linear circulant temporal encoding, CVPR, p.138, 2014. ,
Objects in action: An approach for combining action understanding and object perception, CVPR, p.154, 2007. ,
Tracking a hand manipulating an object, ICCV, vol.172, p.176, 2009. ,
An object-dependent hand pose prior from sparse training data, CVPR, p.176, 2010. ,
Can spatiotemporal 3D CNNs retrace the history of 2D CNNs and ImageNet? In CVPR, vol.35, p.141, 2018. ,
Simultaneous detection and segmentation, ECCV, p.24, 2014. ,
Hypercolumns for object segmentation and fine-grained localization, CVPR, p.24, 2015. ,
Learning joint reconstruction of hands and manipulated objects, CVPR, vol.11, p.13, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02429093
Deep residual learning for image recognition, CVPR, vol.130, p.202, 2015. ,
Towards 3D hand tracking using a deformable model, International Conference on Automatic Face and Gesture Recognition, vol.172, p.175, 1996. ,
Using relaxation to find a puppet, Artificial Intelligence and Simulation of Behaviour, vol.18, p.20, 1976. ,
Long short-term memory, Neural Computation, vol.9, issue.8, p.34, 1997. ,
Model-based vision: a program to see a walking person, Image and Vision Computing, vol.1, issue.1, p.19, 1983. ,
Multi-agent event recognition, ICCV, 2001. ,
Jointly learning heterogeneous features for RGB-D activity recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.39, issue.11, p.141, 2017. ,
Towards accurate marker-less human shape and pose estimation over time, vol.3, p.26, 2017. ,
Deeper-Cut: A deeper, stronger, and faster multi-person pose estimation model, ECCV, 1921. ,
Probabilistic methods for finding people, International Journal of Computer Vision, vol.43, issue.1, pp.45-68, 2001. ,
Latent structured models for human pose estimation, ICCV, vol.48, p.53, 2011. ,
Iterated second-order label sensitive pooling for 3D human pose estimation, CVPR, vol.45, p.53, 1924. ,
6M: Large scale datasets and predictive methods for 3D human sensing in natural environments, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.36, issue.7, p.86, 2014. ,
Hand pose estimation via latent 2.5D heatmap regression, ECCV, vol.173, p.197, 2018. ,
Condensation-conditional density propagation for visual tracking, International Journal of Computer Vision, vol.29, issue.1, p.32, 1998. ,
First-person animal activity recognition from egocentric videos, ICPR, p.154, 2014. ,
Large pose 3D face reconstruction from a single image via direct volumetric CNN regression, ICCV, vol.65, p.67, 2017. ,
Towards understanding action recognition, ICCV, p.24, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00906902
3D convolutional neural networks for human action recognition, ICML, vol.99, p.100, 1998. ,
A large-scale RGB-D database for arbitrary-view human action recognition, ACMMM, vol.124, p.141, 2018. ,
Clustered pose and nonlinear appearance models for human pose estimation, BMVC, vol.20, p.74, 2010. ,
Cardboard people: a parameterized model of articulated image motion, International Conference on Automatic Face and Gesture Recognition, p.32, 1996. ,
Multi-view deep network for cross-view classification, CVPR, p.127, 2016. ,
End-to-end recovery of human shape and pose, CVPR, vol.63, p.177, 2018. ,
Learning category-specific mesh reconstruction from image collections, ECCV, vol.179, p.199, 2018. ,
Efficient feature extraction, encoding, and classification for action recognition, CVPR, vol.101, p.103, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01058734
Large-scale video classification with convolutional neural networks, CVPR, vol.155, p.164, 2014. ,
Neural 3D mesh renderer, CVPR, vol.176, p.179, 2018. ,
The Kinetics human action video dataset, vol.35, p.130, 2017. ,
A new representation of skeleton sequences for 3D action recognition, CVPR, vol.125, p.139, 2017. ,
Hand pose estimation and hand shape classification using multi-layered randomized decision forests, ECCV, vol.172, p.175, 2012. ,
,
A method for stochastic optimization. ICLR, vol.198, p.202, 2014. ,
A spatio-temporal descriptor based on 3D-gradients, BMVC, p.33, 2008. ,
Deeply learned view-invariant features for crossview action recognition, IEEE Transactions on Image Processing, vol.26, issue.6, p.125, 2017. ,
Human action recognition and prediction: A survey. CoRR, abs/1806.11230, p.124, 2018. ,
Depth sweep regression forests for estimating 3D human pose from images, BMVC, vol.64, p.87, 2014. ,
ImageNet classification with deep convolutional neural networks, NIPS, vol.34, p.163, 2012. ,
HMDB: a large video database for human motion recognition, ICCV, vol.153, p.155, 2011. ,
The language of actions: Recovering the syntax and semantics of goal-directed human activities, CVPR, vol.152, p.154, 2014. ,
Beyond Gaussian pyramid: Multi-skip feature stacking for action recognition, CVPR, p.116, 2015. ,
On space-time interest points, International Journal of Computer Vision, vol.64, issue.2-3, p.33, 2005. ,
Modeling and visual recognition of human actions and interactions. Habilitation à diriger des recherches en mathématiques et en informatique, Ecole normale supérieure, 2013. ,
URL : https://hal.archives-ouvertes.fr/tel-01064540
Learning realistic human actions from movies, CVPR, vol.33, p.151, 2008. ,
URL : https://hal.archives-ouvertes.fr/inria-00548659
Unite the people: Closing the loop between 3D and 2D human representations, CVPR, vol.85, p.87, 2017. ,
Backpropagation applied to handwritten zip code recognition, Neural Computation, vol.1, issue.4, p.99, 1989. ,
Determination of 3d human body postures from a single view, vol.30, p.20, 1985. ,
Deep learning for detecting robotic grasps, The International Journal of Robotics Research, p.183, 2015. ,
Multi-view dynamic shape refinement using local temporal integration, ICCV, p.62, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01567758
Efficient implementation of marching cubes cases with topological guarantees, Journal of Graphics Tools, vol.8, issue.2, p.87, 2003. ,
3d human pose estimation from monocular images with deep convolutional neural network, ACCV, p.22, 2014. ,
Modeling the constraints of human hand motion, Proceedings of the Workshop on Human Motion, p.178, 2000. ,
Common objects in context, ECCV, p.20, 2014. ,
Deep convolutional neural fields for depth estimation from a single image, CVPR, vol.25, p.46, 2015. ,
Global context-aware attention LSTM networks for 3D action recognition, CVPR, vol.125, p.139, 2017. ,
Recognizing realistic actions from videos "in the wild, CVPR, vol.151, p.153, 2009. ,
Recognizing human actions by attributes, CVPR, p.34, 2011. ,
Cross-view action recognition via view knowledge transfer, CVPR, p.122, 2011. ,
Spatio-temporal LSTM with trust gates for 3D human action recognition, ECCV, vol.125, p.139, 2016. ,
Recognizing human actions as the evolution of pose estimation maps, CVPR, vol.125, p.139, 2018. ,
Enhanced skeleton visualization for view invariant human action recognition. Pattern Recogn, vol.68, p.139, 2017. ,
Core50: a new dataset and benchmark for continuous object recognition, Proceedings of the 1st Annual Conference on Robot Learning, Proceedings of Machine Learning Research, vol.192, p.204, 2017. ,
Fully convolutional networks for semantic segmentation, CVPR, p.24, 2015. ,
A skinned multi-person linear model, SIGGRAPH Asia, vol.177, p.184, 1992. ,
Motion and shape capture from sparse markers, SIGGRAPH Asia, vol.12, p.62, 2014. ,
Distinctive image features from scale-invariant keypoints, International Journal of Computer Vision, vol.60, p.33, 2004. ,
, , p.118
Are spatial and global constraints really necessary for segmentation? In ICCV, p.24, 2011. ,
Graph distillation for action detection with privileged information, ECCV, vol.125, p.143, 2018. ,
2D/3D pose estimation and action recognition using multitask deep learning, CVPR, vol.66, p.143, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01815703
Partitioned sampling, articulated objects, and interface-quality hand tracking, ECCV, p.172, 2000. ,
Dex-Net 2.0: Deep learning to plan robust grasps with synthetic point clouds and analytic grasp metrics, p.183, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01801048
Object detection and segmentation from joint embedding of parts and pixels, ICCV, p.24, 2011. ,
DeepHPS: End-to-end estimation of 3D hand pose and shape by learning from synthetic depth, vol.3, p.175, 2018. ,
Initialization strategies of spatio-temporal convolutional neural networks, p.35, 2015. ,
Learning appearance in virtual scenarios for pedestrian detection, CVPR, vol.8, p.40, 2010. ,
Representation and recognition of the spatial organization of three-dimensional shapes, Royal Society of London B, vol.18, p.31, 1978. ,
Actions in context, CVPR, p.154, 2009. ,
A simple yet effective baseline for 3D human pose estimation, ICCV, vol.22, p.64, 2017. ,
Deep exemplar 2D-3D detection by adapting from real to rendered views, CVPR, p.127, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01800639
A 3D convolutional neural network for realtime object recognition, IROS, vol.63, p.176, 2015. ,
VNect: Real-time 3D human pose estimation with a single RGB camera, p.22, 2017. ,
Graspit! A versatile simulator for robotic grasping. Robotics Automation Magazine, vol.11, p.203, 2004. ,
English verbs of motion: a case study in semantics and lexical memory, Coding Processes and Human Memory, vol.29, p.30, 1972. ,
,
Visual Analysis of Humans: Looking at People, p.19, 2013. ,
Fast, minimum storage ray-triangle intersection, J. Graph. Tools, p.181, 1997. ,
3D human pose estimation from a single image via distance matrix regression, CVPR, p.22, 2017. ,
Real-time hand tracking under occlusion from an egocentric RGB-D sensor, p.175, 2017. ,
GANerated hands for real-time 3D hand tracking from monocular RGB, CVPR, vol.173, p.197, 2018. ,
From image sequences towards conceptual descriptions, Image and Vision Computing, vol.6, issue.2, pp.59-74, 1988. ,
Event models for recognition and natural language description of events in real-world image sequences, IJCAI, p.29, 1983. ,
Stacked hourglass networks for human pose estimation, ECCV, vol.84, p.92, 2016. ,
Associative embedding: End-to-end learning for joint detection and grouping, NIPS, 1921. ,
Learning motion representation for action recognition, In WACV, p.35, 2018. ,
Beyond short snippets: Deep networks for video classification, CVPR, vol.34, p.116, 2015. ,
Unsupervised learning of human action categories using spatial-temporal words, IJCV, vol.79, issue.3, p.97, 2008. ,
Analyzing gait with spatiotemporal surfaces, Motion of Non-Rigid and Articulated Obgects Workshop, vol.32, p.33, 1994. ,
Numerical Optimization, p.73, 2006. ,
Simplification and repair of polygonal models using volumetric techniques, IEEE Transactions on Visualization and Computer Graphics, vol.9, issue.2, p.67, 2003. ,
,
A large-scale benchmark dataset for event recognition in surveillance video, CVPR, p.152, 2011. ,
Efficient model-based 3D tracking of hand articulations using Kinect, BMVC, p.172, 2011. ,
Full DOF tracking of a hand interacting with an object by modeling occlusions and physical constraints, ICCV, p.176, 2011. ,
Tracking the articulated motion of two strongly interacting hands, CVPR, p.176, 2012. ,
Relevant feature selection for human pose estimation and localization in cluttered images, ECCV, vol.8, p.40, 2008. ,
Deep learning for human part discovery in images, ICRA, vol.46, p.52, 1925. ,
Neural body fitting: Unifying deep learning and model-based human pose and shape estimation, vol.3, p.28, 2018. ,
Model-based image analysis of human motion using constraint propagation, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.2, issue.6, p.19, 1980. ,
Using a single RGB frame for real time 3D hand pose estimation in the wild, In WACV, vol.173, p.175, 2018. ,
Expressive body capture: 3D hands, face, and body from a single image, CVPR, vol.27, p.28, 1926. ,
Coarse-to-fine volumetric prediction for single-image 3D human pose, CVPR, vol.64, p.70, 2017. ,
Ordinal depth supervision for 3D human pose estimation, CVPR, 2018. ,
Learning to estimate 3D human pose and shape from a single color image, CVPR, vol.28, p.177, 2018. ,
Learning deep object detectors from 3D models, ICCV, p.39, 2015. ,
Recovery of nonrigid motion and structure, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.13, p.32, 1991. ,
Improving the Fisher kernel for largescale image classification, ECCV, vol.33, p.163, 2010. ,
URL : https://hal.archives-ouvertes.fr/inria-00548630
Advancing Human Pose and Gesture Recognition, p.19, 2015. ,
Hand-object contact force estimation from markerless visual tracking, IEEE Transactions on Pattern Analysis and Machine Intelligence, p.176, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01356138
Detecting activities of daily living in first-person camera views, CVPR, vol.154, p.155, 2012. ,
Learning people detection models from few training samples, CVPR, vol.8, p.40, 2011. ,
Articulated people detection and pose estimation: Reshaping the future, CVPR, vol.8, p.40, 2012. ,
DeepCut: Joint subset partition and labeling for multi person pose estimation, CVPR, vol.21, p.62, 2016. ,
A model of dynamic human shape in motion, SIGGRAPH, p.44, 2015. ,
Deep multitask architecture for integrated 2D and 3D human sensing, CVPR, vol.22, p.66, 2017. ,
,
Generating human images and ground truth using computer graphics. Master's thesis, UCLA, p.40, 2016. ,
Feature mapping for learning fast and accurate 3D pose inference from synthetic images, CVPR, vol.127, p.134, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-02506574
Learning a deep model for human action recognition from novel viewpoints, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.40, issue.3, p.125, 2018. ,
Learning a non-linear knowledge transfer model for crossview action recognition, CVPR, vol.40, p.138, 2015. ,
3D action recognition from novel viewpoints, CVPR, vol.40, p.41, 2016. ,
Histogram of oriented principal components for cross-view action recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.38, issue.12, p.124, 2016. ,
Reconstructing 3D human pose from 2D image landmarks, ECCV, p.22, 2012. ,
Learning to parse images of articulated bodies, NIPS, 2006. ,
Strike a pose: Tracking people by finding stylized poses, CVPR, p.19, 2005. ,
Visual tracking of high dof articulated structures: an application to human hand tracking, ECCV, vol.172, p.175, 1994. ,
EgoCap: Egocentric marker-less motion capture with two fisheye cameras, SIGGRAPH Asia, p.41, 2016. ,
OctNetFusion: Learning depth fusion from data, vol.3, p.63, 2017. ,
Learning deep 3D representations at high resolutions, CVPR, p.63, 2017. ,
Civilian American and European Surface Anthropometry Resource (CAESAR), Final, vol.43, p.184, 2002. ,
Action mach a spatio-temporal maximum average correlation height filter for action recognition, CVPR, p.151, 2008. ,
MoCap-guided data augmentation for 3D pose estimation in the wild, NIPS, vol.40, p.87, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01389486
3D hand pose detection in egocentric RGB-D images, ECCV Workshop on Consumer Depth Cameras for Computer Vision, p.176, 2014. ,
First-person pose recognition using egocentric workspaces, CVPR, p.176, 2015. ,
Understanding everyday hands in action from RGB-D images, ICCV, p.176, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01237011
Localization-classificationregression for human pose, CVPR, vol.64, p.87, 2017. ,
Towards model-based recognition of human movements in image sequences, CVGIP: Image Understanding, vol.59, issue.1, p.31, 1994. ,
Human movement analysis based on explicit motion models, Motion-Based Recognition, p.31, 1997. ,
Coherent multi-sentence video description with variable level of detail, Pattern Recognition, vol.152, p.154, 2014. ,
A dataset for movie description, CVPR, vol.154, p.155, 0151. ,
A database for fine grained activity detection of cooking activities, CVPR, vol.152, p.155, 2012. ,
Hands in action: real-time 3D reconstruction of hands in interaction with objects, ICRA, vol.176, p.177, 2010. ,
, 2D human pose from optical flow, p.40, 2015.
Embodied hands: Modeling and capturing hands and bodies together, Proc. SIGGRAPH Asia), vol.36, p.184 ,
Learning to parse pictures of people, ECCV, vol.18, p.20, 2002. ,
URL : https://hal.archives-ouvertes.fr/inria-00545109
Convolutional networks for biomedical image segmentation, MICCAI, p.24, 2015. ,
Beyond sharing weights for deep domain adaptation, IEEE Transactions on Pattern Analysis and Machine Intelligence, p.127, 2018. ,
ImageNet large scale visual recognition challenge, International Journal of Computer Vision (IJCV), vol.115, issue.3, p.202, 2015. ,
Spatio-temporal relationship match: Video structure comparison for recognition of complex human activities, ICCV, p.154, 2009. ,
Action bank: A high-level representation of activity in video, CVPR, p.34, 2012. ,
An overview of 3D object grasp synthesis algorithms, Robotics and Autonomous Systems, p.183, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00731127
Introduction to Modern Information Retrieval ,
MODEC: Multimodal decomposable models for human pose estimation, CVPR, vol.20, p.38, 2013. ,
Recognizing human actions: a local SVM approach, ICPR, vol.152, p.153, 1997. ,
A 3-dimensional SIFT descriptor and its application to action recognition, ACM International Conference on Multimedia, p.33, 2007. ,
Time-contrastive networks: Self-supervised learning from video, In ICRA, vol.127, p.137, 2018. ,
Motion-Based Recognition, p.29, 1997. ,
A large scale dataset for 3D human activity analysis, CVPR, vol.123, p.139, 2016. ,
Semantics of human behavior in image sequences, Computer Analysis of Human Behavior, pp.151-182, 2011. ,
Real-time human pose recognition in parts from a single depth image, CVPR, vol.41, p.175, 2011. ,
Learning image statistics for bayesian tracking, ICCV, p.31, 2001. ,
Stochastic tracking of 3d human figures using 2d image motion, ECCV, p.31, 2000. ,
Synchronized video and motion capture dataset and baseline algorithm for evaluation of articulated human motion, International Journal of Computer Vision, vol.87, issue.1, p.23, 2010. ,
Combined discriminative and generative articulated pose and non-rigid shape estimation, NIPS, p.26, 2008. ,
Much ado about time: Exhaustive annotation of temporal data, vol.157, p.158, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01431527
Hollywood in homes: Crowdsourcing data collection for activity understanding, ECCV, p.13, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-01418216
Actor and observer: Joint modeling of first and third-person videos, CVPR, vol.127, p.137, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01755547
Indoor segmentation and support inference from RGBD images, ECCV, p.25, 2012. ,
Hand keypoint detection in single images using multiview bootstrapping, CVPR, vol.173, p.175, 2017. ,
Natural image statistics and neural representation. Annual review of neuroscience, vol.24, p.159, 2001. ,
Very deep convolutional networks for large-scale image recognition, p.163, 2015. ,
Two-stream convolutional networks for action recognition in videos, NIPS, vol.115, p.163, 2014. ,
Learning joint top-down and bottom-up processes for 3D visual inference, CVPR, p.40, 2006. ,
Kinematic jump processes for monocular 3d human tracking, CVPR, p.31, 2003. ,
URL : https://hal.archives-ouvertes.fr/inria-00548223
Unsupervised learning of human motion, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.25, issue.7, p.32, 2003. ,
UCF101: A dataset of 101 human actions classes from videos in the wild, vol.153, p.155, 2012. ,
Cross-modal deep variational hand pose estimation, CVPR, p.173, 2018. ,
Real-time joint tracking of a hand manipulating an object from RGB-D input, ECCV, vol.173, p.176, 2016. ,
Model-based 3D tracking of an articulated hand, CVPR, p.172, 2001. ,
Render for CNN: Viewpoint estimation in images using CNNs trained with rendered 3D model views, ICCV, p.39, 2015. ,
A point set generation network for 3D object reconstruction from a single image, CVPR, vol.63, p.176, 2017. ,
PointNet: Deep learning on point sets for 3D classification and segmentation, CVPR, p.63, 2017. ,
Deep neural networks with inexact matching for person re-identification, NIPS, p.127, 2016. ,
Human pose estimation via deep neural networks, CVPR, vol.20, p.63, 2014. ,
Learning spatiotemporal features with 3D convolutional networks, ICCV, vol.128, p.164, 2015. ,
A closer look at spatiotemporal convolutions for action recognition, CVPR, p.35, 2018. ,
Joint 3D tracking of a deformable object in interaction with a hand, ECCV, vol.176, p.177, 2018. ,
PhotoCity: training experts at large-scale image acquisition through a competitive game, Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, p.154, 2011. ,
Multi-view supervision for singleview reconstruction via differentiable ray consistency, CVPR, p.68, 2017. ,
Self-supervised learning of motion capture, NIPS, vol.63, p.79, 1928. ,
On bodies and events, The Imitative Mind, p.97, 2002. ,
3d object reconstruction from hand-object interactions, ICCV, p.176, 2015. ,
Capturing hands in action using discriminative salient points and physics simulation, International Journal of Computer Vision, vol.118, issue.2, p.187, 2016. ,
Visualizing data using t-sne, Journal of Machine Learning Research, vol.9, p.159, 2008. ,
Learning from synthetic humans, CVPR, vol.84, p.184, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01505711
BodyNet: Volumetric inference of 3D human body shapes, ECCV, p.11, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01852169
Long-term temporal convolutions for action recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.40, issue.6, p.128, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01241518
On view-independent video representations for action recognition, p.11, 2019. ,
Sequence to sequence-video to text, ICCV, p.168, 2015. ,
Localizing and orienting street views using overhead imagery, ECCV, p.127, 2016. ,
Sparse inertial poser: Automatic 3D human pose estimation from sparse IMUs, Eurographics, p.62, 2017. ,
Tracking of persons in monocular image sequences, IEEE Nonrigid and Articulated Motion Workshop, p.31, 1997. ,
Dividing and aggregating network for multi-view action recognition, ECCV, vol.123, p.143, 2018. ,
Action recognition with improved trajectories, ICCV, vol.33, p.164, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00873267
Dense trajectories and motion boundary descriptors for action recognition, International Journal of Computer Vision, vol.103, issue.1, p.34, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00725627
Cross-view action modeling, learning, and recognition, CVPR, vol.124, p.131, 2014. ,
Action recognition with trajectory-pooled deepconvolutional descriptors, CVPR, vol.35, p.116, 2015. ,
Motionlets: Mid-level 3D parts for human motion recognition, CVPR, p.34, 2013. ,
Towards good practices for very deep two-stream convnets, vol.34, p.104, 2015. ,
Temporal segment networks: Towards good practices for deep action recognition, ECCV, vol.34, p.138, 2016. ,
Pixel2Mesh: Generating 3D mesh models from single RGB images, ECCV, vol.176, p.199, 2018. ,
, Octree-based convolutional neural networks for 3D shape analysis. SIGGRAPH, p.63, 2017.
, CVPR, p.116, 2016.
Non-local neural networks, CVPR, p.35, 2018. ,
Video-based hand manipulation capture through composite motion control, ACM Transactions on Graphics (TOG), vol.32, issue.4, p.176, 2013. ,
Learning to learn: Model regression networks for easy small sample learning, ECCV, p.127, 2016. ,
Convolutional pose machines, CVPR, vol.62, p.63, 2016. ,
Action recognition from arbitrary views using 3D exemplars, ICCV, p.124, 2007. ,
URL : https://hal.archives-ouvertes.fr/inria-00544741
MarrNet: 3D shape reconstruction via 2.5D sketches, NIPS, p.176, 2017. ,
Capturing natural hand articulation, ICCV, p.172, 2001. ,
Parameterized modeling and recognition of activities, ICCV, p.32, 1998. ,
Perspective transformer nets: Learning single-view 3D object reconstruction without 3D supervision, NIPS, vol.63, p.68, 2016. ,
Estimation of human body shape in motion with wide clothing, ECCV, p.62, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01344795
Articulated pose estimation with flexible mixtures-ofparts, CVPR, p.19, 2011. ,
Articulated human detection with flexible mixtures of parts, IEEE Trans. Pattern Anal. Mach. Intell, vol.35, issue.12, p.19, 2013. ,
A dual-source approach for 3D pose estimation from a single image, CVPR, vol.22, p.87, 2016. ,
LIFT: Learned invariant feature transform, ECCV, p.127, 2016. ,
, Construction of a largescale image dataset using deep learning with humans in the loop, vol.45, p.185, 2015.
Learning semantic deformation flows with 3D convolutional networks, ECCV, p.63, 2016. ,
Learning to compare image patches via convolutional neural networks, CVPR, p.127, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01246261
Visualizing and understanding convolutional networks, ECCV, p.118, 2014. ,
Event-based analysis of video, CVPR, vol.32, p.33, 2001. ,
Real-time action recognition with enhanced motion vector CNNs, CVPR, p.100, 2016. ,
, 3D hand pose tracking and estimation using stereo matching, p.196, 2016.
View adaptive recurrent neural networks for high performance human action recognition from skeleton data, vol.123, p.125, 2017. ,
Learning view-invariant sparse representations for cross-view action recognition, ICCV, p.125, 2013. ,
Cross-view action recognition via transferable dictionary learning, IEEE Transactions on Image Processing, vol.25, issue.6, p.125, 2016. ,
Learning deep features for scene recognition using places database, NIPS, vol.97, p.151, 2014. ,
Sparseness meets deepness: 3D human pose estimation from monocular video, CVPR, vol.22, p.40, 2016. ,
Deep kinematic pose regression, ECCV Workshop on Geometry Meets Deep Learning, p.22, 2016. ,
Towards 3D human pose estimation in the wild: A weakly-supervised approach, ICCV, vol.62, p.64, 2017. ,
Action recognition with actons, ICCV, p.34, 2013. ,
Rethinking reprojection: Closing the loop for pose-aware shape reconstruction from a single image, ICCV, p.68, 2017. ,
Learning to estimate 3D hand pose from single RGB images, ICCV, vol.173, p.197, 2017. ,
The psycho-biology of language, p.159, 1935. ,
Bringing semantics into focus using visual abstraction, CVPR, p.155, 2013. ,
Chained multi-stream networks exploiting pose, motion, and appearance for action classification and detection, ICCV, vol.125, p.143, 2017. ,