Monocular Human Shape and Pose with Dense Mesh-borne Local Image Features - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

Monocular Human Shape and Pose with Dense Mesh-borne Local Image Features

Résumé

We propose to improve on graph convolution based approaches for human shape and pose estimation from monocular input, using pixel-aligned local image features. Given a single input color image, existing graph convolutional network (GCN) based techniques for human shape and pose estimation (e.g. [19]) use a single convolutional neural network (CNN) generated global image feature appended to all mesh vertices equally to initialize the GCN stage, which transforms a template T-posed mesh into the target pose. In contrast, we propose for the first time the idea of using local image features per vertex. These features are sampled from the CNN image feature maps by utilizing pixel-to-mesh correspondences generated with DensePose [11]. Our quantitative and qualitative results on standard benchmarks show that using local features improves on global ones and leads to competitive performances with respect to the state-of-the-art.
Fichier principal
Vignette du fichier
Monocular_Human_Shape_and_Pose_with_Dense_Mesh-borne_Local_Image_Features.pdf (14.68 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03524051 , version 1 (13-01-2022)

Identifiants

Citer

Shubhendu Jena, Franck Multon, Adnane Boukhayma. Monocular Human Shape and Pose with Dense Mesh-borne Local Image Features. FG 2021 - IEEE International Conference on Automatic Face and Gesture Recognition, Dec 2021, Jodhpur (online), India. pp.1-5, ⟨10.1109/FG52635.2021.9666993⟩. ⟨hal-03524051⟩
93 Consultations
95 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More