People Watching: Human Actions as a Cue for Single View Geometry

David Fouhey 1 Vincent Delaitre 2 Abhinav Gupta 1 Alexei A. Efros 1 Ivan Laptev 3 Josef Sivic 3
3 WILLOW - Models of visual object recognition and scene understanding
CNRS - Centre National de la Recherche Scientifique : UMR8548, Inria Paris-Rocquencourt, DI-ENS - Département d'informatique de l'École normale supérieure
Abstract : We present an approach which exploits the coupling between human actions and scene geometry. We investigate the use of human pose as a cue for single-view 3D scene understanding. Our method builds upon recent advances in still-image pose estimation to extract functional and geometric constraints about the scene. These constraints are then used to improve state-of-the-art single-view 3D scene understanding approaches. The proposed method is validated on a collection of monocular time-lapse sequences collected from YouTube and a dataset of still images of indoor scenes. We demonstrate that observing people performing different actions can significantly improve estimates of 3D scene geometry.
Type de document :
Communication dans un congrès
Andrew Fitzgibbon and Svetlana Lazebnik and Pietro Perona and Yoichi Sato and Cordelia Schmid. ECCV'12 - 12th European Conference on Computer Vision, Oct 2012, Florence, Italy. Springer, 7576, pp.732-735, 2012, LNCS - Lecture Notes in Computer Science. 〈10.1007/978-3-642-33715-4_53〉
Liste complète des métadonnées

Littérature citée [39 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01060874
Contributeur : Vincent Delaitre <>
Soumis le : jeudi 4 septembre 2014 - 14:24:01
Dernière modification le : vendredi 25 mai 2018 - 12:02:06
Document(s) archivé(s) le : vendredi 5 décembre 2014 - 10:28:10

Fichier

fouhey_ECCV12.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

David Fouhey, Vincent Delaitre, Abhinav Gupta, Alexei A. Efros, Ivan Laptev, et al.. People Watching: Human Actions as a Cue for Single View Geometry. Andrew Fitzgibbon and Svetlana Lazebnik and Pietro Perona and Yoichi Sato and Cordelia Schmid. ECCV'12 - 12th European Conference on Computer Vision, Oct 2012, Florence, Italy. Springer, 7576, pp.732-735, 2012, LNCS - Lecture Notes in Computer Science. 〈10.1007/978-3-642-33715-4_53〉. 〈hal-01060874〉

Partager

Métriques

Consultations de la notice

3535

Téléchargements de fichiers

134