D. Crandall, L. Backstrom, D. Huttenlocher, and J. Kleinberg, Mapping the world's photos, Proceedings of the 18th international conference on World wide web, WWW '09, pp.761-770, 2009.
DOI : 10.1145/1526709.1526812

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.886-893, 2005.
DOI : 10.1109/CVPR.2005.177
URL : https://hal.archives-ouvertes.fr/inria-00548512

C. Doersch, A. Gupta, and A. A. Efros, Mid-level visual element discovery as discriminative mode seeking, Advances in Neural Information Processing Systems (NIPS), pp.494-502, 2013.

J. Fiss, A. Agarwala, and B. Curless, Candid portrait selection from video, ACM Transactions on Graphics (SIGGRAPH Asia), vol.30, issue.6, p.128, 2011.
DOI : 10.1145/2070781.2024162

J. Hays and A. Efros, Im2gps: estimating geographic information from a single image Bathroom: Closet: Shoe store: Casino: Office: Indoor Scenes: Cars Over Time, IEEE Conference Bedroom, 1920.

E. Kalogerakis, O. Vesselova, J. Hays, A. Efros, and A. Hertzmann, Image sequence geolocation with human travel priors, 2009 IEEE 12th International Conference on Computer Vision, pp.253-260, 2009.
DOI : 10.1109/ICCV.2009.5459259

J. Knopp, J. Sivic, and T. Pajdla, Avoiding Confusing Features in Place Recognition, European Conference on Computer Vision (ECCV), pp.748-761, 2010.
DOI : 10.1007/978-3-642-15549-9_54

Y. J. Lee, A. A. Efros, and M. Hebert, Style-Aware Mid-level Representation for Discovering Visual Connections in Space and Time, 2013 IEEE International Conference on Computer Vision, pp.1857-1864, 2013.
DOI : 10.1109/ICCV.2013.233

X. Li, C. Wu, C. Zach, S. Lazebnik, and J. Frahm, Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs, European Conference on Computer Vision (ECCV), pp.427-440, 2008.
DOI : 10.1007/978-3-540-88682-2_33

Y. Li, D. Crandall, and D. Huttenlocher, Landmark classification in large-scale image collections, IEEE 12th International Conference on Computer Vision (ICCV), pp.1957-1964, 2009.

P. Mueller, P. Wonka, S. Haegler, A. Ulmer, and L. Van-gool, Procedural modeling of buildings, ACM Transactions on Graphics, vol.25, issue.3, pp.614-623, 2006.
DOI : 10.1145/1141911.1141931

A. Oliva and A. Torralba, Chapter 2 Building the gist of a scene: the role of global image features in recognition, Progress in brain research, vol.155, pp.23-36, 2006.
DOI : 10.1016/S0079-6123(06)55002-2

K. Paik, The Art of Ratatouille, Chronicle Books, 2006.

T. Quack, B. Leibe, and L. Van-gool, World-scale mining of objects and events from community photo collections, Proceedings of the 2008 international conference on Content-based image and video retrieval, CIVR '08, pp.47-56, 2008.
DOI : 10.1145/1386352.1386363

B. C. Russell, A. A. Efros, J. Sivic, W. T. Freeman, and A. Zisserman, Using Multiple Segmentations to Discover Objects and their Extent in Image Collections, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.1605-1614, 2006.
DOI : 10.1109/CVPR.2006.326

G. Schindler, M. Brown, and R. Szeliski, City-Scale Location Recognition, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-7, 2007.
DOI : 10.1109/CVPR.2007.383150

A. Shrivastava, T. Malisiewicz, A. Gupta, and A. A. Efros, Data-driven visual similarity for cross-domain image matching, ACM Transactions on Graphics, issue.6, p.30154, 2011.

I. Simon, N. Snavely, and S. M. Seitz, Scene Summarization for Online Image Collections, 2007 IEEE 11th International Conference on Computer Vision, pp.1-8, 2007.
DOI : 10.1109/ICCV.2007.4408863

S. Singh, A. Gupta, and A. A. Efros, Unsupervised Discovery of Mid-Level Discriminative Patches, pp.73-86, 2012.
DOI : 10.1007/978-3-642-33709-3_6

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, pp.1470-1477, 2003.
DOI : 10.1109/ICCV.2003.1238663

O. Teboul, L. Simon, P. Koutsourakis, and N. Paragios, Segmentation of building facades using procedural shape priors, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.3105-3112, 2010.
DOI : 10.1109/CVPR.2010.5540068
URL : https://hal.archives-ouvertes.fr/hal-00856063

A. Torralba and A. Oliva, Statistics of natural image categories, Network: Computation in Neural Systems, vol.14, issue.3, pp.391-412, 2003.
DOI : 10.1088/0954-898X_14_3_302