BodyNet: Volumetric Inference of 3D Human Body Shapes

Gül Varol 1, 2 Duygu Ceylan 3 Bryan Russell 3 Jimei Yang 3 Ersin Yumer 3 Ivan Laptev 1 Cordelia Schmid 2
1 WILLOW - Models of visual object recognition and scene understanding
DI-ENS - Département d'informatique de l'École normale supérieure, Inria de Paris
2 Thoth - Apprentissage de modèles à partir de données massives
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann
Abstract : Human shape estimation is an important task for video editing , animation and fashion industry. Predicting 3D human body shape from natural images, however, is highly challenging due to factors such as variation in human bodies, clothing and viewpoint. Prior methods addressing this problem typically attempt to fit parametric body models with certain priors on pose and shape. In this work we argue for an alternative representation and propose BodyNet, a neural network for direct inference of volumetric body shape from a single image. BodyNet is an end-to-end trainable network that benefits from (i) a volumetric 3D loss, (ii) a multi-view re-projection loss, and (iii) intermediate supervision of 2D pose, 2D body part segmentation, and 3D pose. Each of them results in performance improvement as demonstrated by our experiments. To evaluate the method, we fit the SMPL model to our network output and show state-of-the-art results on the SURREAL and Unite the People datasets, outperforming recent approaches. Besides achieving state-of-the-art performance, our method also enables volumetric body-part segmentation.
Document type :
Conference papers
Liste complète des métadonnées

Cited literature [27 references]  Display  Hide  Download

https://hal.inria.fr/hal-01852169
Contributor : Gul Varol <>
Submitted on : Saturday, August 18, 2018 - 2:00:48 PM
Last modification on : Thursday, February 7, 2019 - 4:21:59 PM

File

VarolECCV2018.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Gül Varol, Duygu Ceylan, Bryan Russell, Jimei Yang, Ersin Yumer, et al.. BodyNet: Volumetric Inference of 3D Human Body Shapes. ECCV 2018 - 15th European Conference on Computer Vision, Sep 2018, Munich, Germany. pp.20-38, ⟨10.1007/978-3-030-01234-2_2⟩. ⟨hal-01852169⟩

Share

Metrics

Record views

294

Files downloads

239