Skip to Main content Skip to Navigation
Conference papers

Monocular Human Motion Capture with a Mixture of Regressors

Ankur Agarwal 1 Bill Triggs 1
1 LEAR - Learning and recognition in vision
GRAVIR - IMAG - Laboratoire d'informatique GRAphique, VIsion et Robotique de Grenoble, Inria Grenoble - Rhône-Alpes, CNRS - Centre National de la Recherche Scientifique : FR71
Abstract : We address 3D human motion capture from monocular images, taking a learning based approach to construct a probabilistic pose estimation model from a set of labelled human silhouettes. To compensate for ambiguities in the pose reconstruction problem, our model explicitly calculates several possible pose hypotheses. It uses locality on a manifold in the input space and connectivity in the output space to identify regions of multi-valuedness in the mapping from silhouette to 3D pose. This information is used to fit a mixture of regressors on the input manifold, giving us a global model capable of predicting the possible poses with corresponding probabilities. These are then used in a dynamicalmodel based tracker that automatically detects tracking failures and re-initializes in a probabilistically correct manner. The system is trained on conventional motion capture data, using both the corresponding real human silhouettes and silhouettes synthesized artificially from several different models for improved robustness to inter-person variations. Static pose estimation is illustrated on a variety of silhouettes. The robustness of the method is demonstrated by tracking on a real image sequence requiring multiple automatic re-initializations.
Document type :
Conference papers
Complete list of metadata
Contributor : Thoth Team <>
Submitted on : Monday, December 20, 2010 - 9:09:02 AM
Last modification on : Monday, December 28, 2020 - 3:44:02 PM
Long-term archiving on: : Monday, March 21, 2011 - 3:08:41 AM





Ankur Agarwal, Bill Triggs. Monocular Human Motion Capture with a Mixture of Regressors. IEEE Workshop on Vision for Human Computer Interaction at Computer Vision and Pattern Recognition (CVPR '05), Jun 2005, San Diego, United States. pp.72, ⟨10.1109/CVPR.2005.496⟩. ⟨inria-00548522⟩



Record views


Files downloads