Leveraging Photometric Consistency over Time for Sparsely Supervised Hand-Object Reconstruction

Yana Hasson; Bugra Tekin; Federica Bogo; Ivan Laptev; Marc Pollefeys; Cordelia Schmid

doi:10.1109/CVPR42600.2020.00065

Communication Dans Un Congrès Année : 2020

Leveraging Photometric Consistency over Time for Sparsely Supervised Hand-Object Reconstruction

(1) , (2) , (2) , (1) , (3) , (4)

1
2
3
4

Yana Hasson

Fonction : Auteur

Models of visual object recognition and scene understanding

Bugra Tekin

Fonction : Auteur

Microsoft Research

Federica Bogo

Fonction : Auteur

Microsoft Research

Ivan Laptev

Fonction : Auteur

Models of visual object recognition and scene understanding

Marc Pollefeys

Fonction : Auteur

Eidgenössische Technische Hochschule - Swiss Federal Institute of Technology [Zürich]

Cordelia Schmid

Fonction : Auteur
PersonId : 831154

Apprentissage de modèles à partir de données massives

Résumé

Modeling hand-object manipulations is essential for understanding how humans interact with their environment. While of practical importance, estimating the pose of hands and objects during interactions is challenging due to the large mutual occlusions that occur during manipulation. Recent efforts have been directed towards fully-supervised methods that require large amounts of labeled training samples. Collecting 3D ground-truth data for hand-object interactions, however, is costly, tedious, and error-prone. To overcome this challenge we present a method to leverage photometric consistency across time when annotations are only available for a sparse subset of frames in a video. Our model is trained end-to-end on color images to jointly reconstruct hands and objects in 3D by inferring their poses. Given our estimated reconstructions, we differentiably render the optical flow between pairs of adjacent images and use it within the network to warp one frame to another. We then apply a self-supervised photometric loss that relies on the visual consistency between nearby images. We achieve state-of-the-art results on 3D hand-object reconstruction benchmarks and demonstrate that our approach allows us to improve the pose estimation accuracy by leveraging information from neighboring frames in low-data regimes.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

cvpr20yana.pdf (11.21 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Yana Hasson : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02557112

Soumis le : mardi 28 avril 2020-14:40:53

Dernière modification le : samedi 27 avril 2024-03:12:32

Dates et versions

hal-02557112 , version 1 (28-04-2020)

Identifiants

HAL Id : hal-02557112 , version 1
DOI : 10.1109/CVPR42600.2020.00065

Citer

Yana Hasson, Bugra Tekin, Federica Bogo, Ivan Laptev, Marc Pollefeys, et al.. Leveraging Photometric Consistency over Time for Sparsely Supervised Hand-Object Reconstruction. CVPR 2020 - IEEE Conference on Computer Vision and Pattern Recognition, Jun 2020, Seattle / Virtual, United States. pp.568-577, ⟨10.1109/CVPR42600.2020.00065⟩. ⟨hal-02557112⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS UNIV-RENNES1 UGA CNRS INRIA IRISA INSMI LJK LJK_GI INRIA2 LJK-GI-THOTH PSL UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES ANR PRAIRIE-IA UR1-MATH-NUM

177 Consultations

221 Téléchargements

Leveraging Photometric Consistency over Time for Sparsely Supervised Hand-Object Reconstruction

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager