R. Arandjelovi´carandjelovi´c and A. Zisserman, Smooth object retrieval using a bag of boundaries, ICCV, 2011.

M. Aubry, B. Russell, and J. Sivic, Painting-to-3D model alignment via discriminative visual elements, ACM Transactions on Graphics, vol.33, issue.2, 2004.
DOI : 10.1145/2591009

URL : https://hal.archives-ouvertes.fr/hal-00863615

G. Baatz, O. Saurer, K. Köser, and M. Pollefeys, Large Scale Visual Geo-Localization of Images in Mountainous Terrain, ECCV, 2012.
DOI : 10.1007/978-3-642-33709-3_37

L. Bourdev and J. Malik, Poselets: Body part detectors trained using 3D human pose annotations, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459303

W. Choi, Y. Chao, C. Pantofaru, and S. Savarese, Understanding Indoor Scenes Using 3D Geometric Phrases, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013.
DOI : 10.1109/CVPR.2013.12

O. Chum, J. Philbin, J. Sivic, M. Isard, and A. Zisserman, Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval, 2007 IEEE 11th International Conference on Computer Vision, 2007.
DOI : 10.1109/ICCV.2007.4408891

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005.
DOI : 10.1109/CVPR.2005.177

URL : https://hal.archives-ouvertes.fr/inria-00548512

T. Dean, M. Ruzon, M. Segal, J. Shlens, S. Vijayanarasimhan et al., Fast, Accurate Detection of 100,000 Object Classes on a Single Machine, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013.
DOI : 10.1109/CVPR.2013.237

L. Del-pero, J. Bowdish, B. Kermgard, E. Hartley, and K. Barnard, Understanding Bayesian Rooms Using Composite 3D Object Models, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013.
DOI : 10.1109/CVPR.2013.27

C. Doersch, A. Gupta, and A. A. Efros, Mid-level visual element discovery as discriminative mode seeking, NIPS, 2013.

M. Everingham, L. Van-gool, C. K. Williams, J. Winn, and A. Zisserman, The Pascal Visual Object Classes (VOC) Challenge, International Journal of Computer Vision, vol.73, issue.2, pp.303-338, 2010.
DOI : 10.1007/s11263-009-0275-4

P. Felzenszwalb, R. Girshick, D. Mcallester, and D. Ramanan, Object Detection with Discriminatively Trained Part-Based Models, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.9, 2007.
DOI : 10.1109/TPAMI.2009.167

S. Fidler, S. Dickinson, and R. Urtasun, 3D object detection and viewpoint estimation with a deformable 3D cuboid model, NIPS, 2012.

M. Gharbi, T. Malisiewicz, S. Paris, and F. Durand, A Gaussian approximation of feature space for fast image similarity, p.2012

D. Glasner, M. Galun, S. Alpert, R. Basri, and G. Shakhnarovich, Viewpoint-aware object detection and pose estimation, ICCV, 2011.

A. Gupta, A. A. Efros, and M. Hebert, Blocks World Revisited: Image Understanding Using Qualitative Geometry and Mechanics, ECCV, 2010.
DOI : 10.1007/978-3-642-15561-1_35

B. Hariharan, J. Malik, and D. Ramanan, Discriminative Decorrelation for Clustering and Classification, ECCV, 2012.
DOI : 10.1007/978-3-642-33765-9_33

M. Hejrati and D. Ramanan, Analyzing 3D objects in cluttered images, NIPS, 2012

D. P. Huttenlocher and S. Ullman, Object recognition using alignment, ICCV, 1987.

A. Jain, A. Gupta, M. Rodriguez, and L. S. Davis, Representing Videos Using Mid-level Discriminative Patches, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013.
DOI : 10.1109/CVPR.2013.332

M. Juneja, A. Vedaldi, C. V. Jawahar, and A. Zisserman, Blocks That Shout: Distinctive Parts for Scene Classification, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013.
DOI : 10.1109/CVPR.2013.124

Y. Li, N. Snavely, D. Huttenlocher, and P. Fua, Worldwide pose estimation using 3D point clouds, ECCV, 2012.

J. Lim, H. Pirsiavash, and A. Torralba, Parsing IKEA Objects: Fine Pose Estimation, 2013 IEEE International Conference on Computer Vision, 2013.
DOI : 10.1109/ICCV.2013.372

D. Lowe, The viewpoint consistency constraint, International Journal of Computer Vision, vol.171, issue.1, pp.57-72, 1987.
DOI : 10.1007/BF00128526

D. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94

T. Malisiewicz, A. Gupta, and A. A. Efros, Ensemble of exemplar-SVMs for object detection and beyond, 2011 International Conference on Computer Vision, 2006.
DOI : 10.1109/ICCV.2011.6126229

J. L. Mundy, Object Recognition in the Geometric Era: A Retrospective, Toward Category-Level Object Recognition, pp.3-29, 2006.
DOI : 10.1007/11957959_1

B. Pepik, M. Stark, P. Gehler, and B. Schiele, Teaching 3D geometry to deformable part models, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6248075

J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman, Object retrieval with large vocabularies and fast spatial matching, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383172

L. Roberts, Machine perception of 3-D solids, 1965.

F. Rothganger, S. Lazebnik, C. Schmid, and J. Ponce, 3D object modeling and recognition using affine-invariant patches and multi-view spatial constraints, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings., 2003.
DOI : 10.1109/CVPR.2003.1211480

URL : https://hal.archives-ouvertes.fr/inria-00548224

S. Satkin, J. Lin, and M. Hebert, Data-Driven Scene Understanding from 3D Models, Procedings of the British Machine Vision Conference 2012, 2012.
DOI : 10.5244/C.26.128

T. Sattler, B. Leibe, and L. Kobbelt, Fast image-based localization using direct 2D-to-3D matching, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126302

S. Singh, A. Gupta, and A. A. Efros, Unsupervised Discovery of Mid-Level Discriminative Patches, ECCV, 2012.
DOI : 10.1007/978-3-642-33709-3_6

P. Viola and M. Jones, Rapid object detection using a boosted cascade of simple classifiers, CVPR, 2001.

Y. Xiang and S. Savarese, Estimating the aspect layout of object categories, 2012 IEEE Conference on Computer Vision and Pattern Recognition
DOI : 10.1109/CVPR.2012.6248081

J. Xiao, B. Russell, and A. Torralba, Localizing 3D cuboids in single-view images, NIPS, 2012.

J. Yagnik, D. Strelow, D. Ross, and R. Lin, The power of comparative reasoning, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126527

M. Zia, M. Stark, B. Schiele, and K. Schindler, Detailed 3D Representations for Object Recognition and Modeling, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, issue.11
DOI : 10.1109/TPAMI.2013.87