Abstract : We present an approach which exploits the coupling between human actions and scene geometry to use human pose as a cue for single-view 3D scene understanding. Our method builds upon recent advances in still-image pose estimation to extract functional and geometric constraints on the scene. These constraints are then used to improve single-view 3D scene understanding approaches. The proposed method is validated on monocular time-lapse sequences from YouTube and still images of indoor scenes gathered from the Internet. We demonstrate that observing people performing different actions can significantly improve estimates of 3D scene geometry.
https://hal.inria.fr/hal-01066257 Contributor : Minsu ChoConnect in order to contact the contributor Submitted on : Friday, September 19, 2014 - 2:28:58 PM Last modification on : Thursday, March 17, 2022 - 10:08:39 AM Long-term archiving on: : Friday, April 14, 2017 - 12:57:31 PM
David Fouhey, V. Delaitre, Abhinav Gupta, Alexei Efros, Ivan Laptev, et al.. People Watching: Human Actions as a Cue for Single View Geometry. International Journal of Computer Vision, Springer Verlag, 2014, pp.17. ⟨hal-01066257⟩