J. Gibson, The ecological approach to visual perception, 1979.

M. Andriluka, S. Roth, and B. Schiele, Pictorial structures revisited: People detection and articulated pose estimation, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206754

Y. Yang and D. Ramanan, Articulated pose estimation using flexible mixtures of parts, In: CVPR, 2011.

S. Johnson and M. Everingham, Learning effective human pose estimation from inaccurate annotation, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995318

C. J. Taylor, Reconstruction of articulated objects from point correspondences in a single image, In: CVPR, 2000.

V. Hedau, D. Hoiem, and D. Forsyth, Recovering the spatial layout of cluttered rooms, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459411

D. Hoiem, A. Efros, and M. Hebert, Putting objects in perspective, IJCV, 2008.

V. Hedau, D. Hoiem, and D. Forsyth, Thinking Inside the Box: Using Appearance Models and Context Based on Room Geometry, 2010.
DOI : 10.1007/978-3-642-15567-3_17

S. X. Yu, H. Zhang, and J. Malik, Inferring spatial layout from a single image via depth-ordered grouping, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2008.
DOI : 10.1109/CVPRW.2008.4562977

D. Lee, A. Gupta, M. Hebert, and T. Kanade, Estimating spatial layout of rooms using volumetric reasoning about objects and surfaces, In: NIPS, 2010.

D. Hoiem, A. Efros, and M. Hebert, Geometric context from a single image, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005.
DOI : 10.1109/ICCV.2005.107

H. Wang, S. Gould, and D. Koller, Discriminative learning with latent variables for cluttered indoor scene understanding, Communications of the ACM, vol.56, issue.4, 2010.
DOI : 10.1145/2436256.2436276

A. Gupta, A. Efros, and M. Hebert, Blocks World Revisited: Image Understanding Using Qualitative Geometry and Mechanics, 2010.
DOI : 10.1007/978-3-642-15561-1_35

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.170.6991

O. Barinova, V. Lempitsky, E. Tretyak, and P. Kohli, Geometric Image Parsing in Man-Made Environments, 2010.
DOI : 10.1007/978-3-642-15552-9_5

D. Pero, L. Guan, J. Brau, E. Schlecht, J. Barnard et al., Sampling bedrooms, 2011.

N. Payet and S. Todorovic, Scene shape from texture of objects, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995326

A. Schwing, T. Hazan, M. Pollefeys, and R. Urtasun, Efficient structured prediction for 3D indoor scene understanding, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6248006

D. Pero, L. Bowdish, J. Fried, D. Kermgard, B. Hartley et al., Bayesian geometric modeling of indoor scenes, 2012.

A. Gupta and L. S. Davis, Objects in Action: An Approach for Combining Action Understanding and Object Perception, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383331

M. Turek, A. Hoogs, and R. Collins, Unsupervised Learning of Functional Categories in Video Scenes, In: ECCV, 2010.
DOI : 10.1007/978-3-642-15552-9_48

V. Delaitre, J. Sivic, and I. Laptev, Learning person-object interactions for action recognition in still images, In: NIPS, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00648156

A. Prest, C. Schmid, and V. Ferrari, Weakly Supervised Learning of Interactions between Humans and Objects, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.3, 2011.
DOI : 10.1109/TPAMI.2011.158

URL : https://hal.archives-ouvertes.fr/inria-00516477

J. Gall, A. Fossati, and L. Van-gool, Functional categorization of objects using realtime markerless motion capture, In: CVPR, 2011.

H. Kjellstrom, J. Romero, D. Martinez, and D. Kragic, Simultaneous Visual Recognition of Manipulation Actions and Manipulated Objects, In: ECCV, 2008.
DOI : 10.1007/978-3-540-88688-4_25

C. Desai, D. Ramanan, and C. Fowlkes, Discriminative models for static humanobject interactions, 2010.

B. Yao, A. Khosla, and L. Fei-fei, Classifying actions and measuring action similarity by modeling the mutual context of objects and human poses, Proc. ICML, 2011.

A. Gupta, T. Chen, F. Chen, D. Kimber, and L. Davis, Context and observation driven latent variable model for human pose estimation, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587511

A. Gupta, S. Satkin, A. Efros, and M. Hebert, From 3D scene geometry to human workspace, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995448

L. Bourdev and J. Malik, Poselets: Body part detectors trained using 3D human pose annotations, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459303

P. Felzenszwalb, D. Mcallester, and D. Ramanan, A discriminatively trained, multiscale, deformable part model, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587597

L. Guan, J. S. Franco, and M. Pollefeys, occlusion inference from silhouette cues, In: CVPR, p.3, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00348980

N. Krahnstoever and P. R. Mendonca, Bayesian autocalibration for surveillance, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005.
DOI : 10.1109/ICCV.2005.44

D. Rother, K. Patwardhan, and G. Sapiro, What Can Casual Walkers Tell Us About A 3D Scene?, 2007 IEEE 11th International Conference on Computer Vision, 2007.
DOI : 10.1109/ICCV.2007.4409082

A. Schodl and I. Essa, Depth layers from occlusions, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, 2001.
DOI : 10.1109/CVPR.2001.990534

J. Coughlan and A. Yuille, The Manhattan world assumption: Regularities in scene statistics which enable bayesian inference, In: NIPS, 2000.

D. Lee, M. Hebert, and T. Kanade, Geometric reasoning for single image structure recovery, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206872

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005.
DOI : 10.1109/CVPR.2005.177

URL : https://hal.archives-ouvertes.fr/inria-00548512

V. Hedau, D. Hoiem, and D. Forsyth, Recovering free space of indoor scenes from a single image, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6248005

V. Delaitre, D. Fouhey, I. Laptev, J. Sivic, A. Efros et al., Scene Semantics from Long-Term Observation of People, In: ECCV, 2012.
DOI : 10.1007/978-3-642-33783-3_21

URL : https://hal.archives-ouvertes.fr/hal-01060880