V. Eglin and S. Bres, Document page similarity based on layout visual saliency: application to query by example and document classification, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings., pp.1208-1212, 2003.
DOI : 10.1109/ICDAR.2003.1227849

C. Shin and D. Doermann, Document image retrieval based on layout structural similarity, IPCV, pp.606-612, 2006.

J. Deng, W. Dong, R. Socher, L. Li, K. Li et al., Imagenet: A largescale hierarchical image database, Computer Vision and Pattern Recognition, pp.248-255, 2009.

G. Csurka, C. R. Dance, L. Fan, J. Willamowski, and C. Bray, Visual categorization with bags of keypoints, Int. Work. on Stat. Learning in Comp. Vision, 2004.

D. Lowe, Object recognition from local scale-invariant features, Proceedings of the Seventh IEEE International Conference on Computer Vision, 1999.
DOI : 10.1109/ICCV.1999.790410

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.121.4065

L. P. De-las-heras, O. R. Terrades, J. Llados, D. Fernandez-mota, and C. Canero, Use case visual Bag-of-Words techniques for camera based identity document classification, 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp.721-725, 2015.
DOI : 10.1109/ICDAR.2015.7333856

J. Kumar and D. Doermann, Unsupervised Classification of Structurally Similar Document Images, 2013 12th International Conference on Document Analysis and Recognition, pp.1225-1229, 2013.
DOI : 10.1109/ICDAR.2013.248

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.359.8246

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, Proceedings of the European Conference on Computer Vision, 2010.
DOI : 10.1007/978-3-642-15561-1_11

URL : https://hal.archives-ouvertes.fr/inria-00548630

H. Jégou, F. Perronnin, M. Douze, and C. Schmid, Aggregating Local Image Descriptors into Compact Codes, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.9, 2012.
DOI : 10.1109/TPAMI.2011.235

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.2169-2178, 2006.
DOI : 10.1109/CVPR.2006.68

URL : https://hal.archives-ouvertes.fr/inria-00548585

H. Emrah-tasli, R. Sicre, T. Gevers, and . Aydin-alatan, Geometry-constrained spatial pyramid adaptation for image classification, International Conference on Image Processing, 2014.

S. Chen, Y. Sun, and S. Naoi, Structured document classification by matching local salient features, Pattern Recognition (ICPR), International Conference on, pp.653-656, 2012.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in neural information processing systems, pp.1097-1105, 2012.

M. Oquab, L. Bottou, I. Laptev, and J. Sivic, Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.222

URL : https://hal.archives-ouvertes.fr/hal-00911179

B. Zhou, A. Lapedriza, J. Xiao, A. Torralba, and A. Oliva, Learning Deep Features for Scene Recognition using Places Database, Advances in Neural Information Processing Systems, 2014.

Y. Gong, L. Wang, R. Guo, and S. Lazebnik, Multi-scale Orderless Pooling of Deep Convolutional Activation Features, Proceedings of the European Conference on Computer Vision, 2014.
DOI : 10.1007/978-3-319-10584-0_26

URL : http://arxiv.org/abs/1403.1840

R. Arandjelovi?, P. Gronat, A. Torii, T. Pajdla, and J. Sivic, NetVLAD: CNN Architecture for Weakly Supervised Place Recognition, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
DOI : 10.1109/CVPR.2016.572

L. Liu, C. Shen, L. Wang, A. Van-den-hengel, and C. Wang, Encoding high dimensional local features by sparse coding based fisher vectors, Advances in Neural Information Processing Systems, pp.1143-1151, 2014.

M. Cimpoi, S. Maji, and A. Vedaldi, Deep filter banks for texture recognition and segmentation, Proceedings of the IEEE CVPR, pp.3828-3836, 2015.
DOI : 10.1109/cvpr.2015.7299007

URL : https://hal.archives-ouvertes.fr/hal-01263622

F. Pedro, . Felzenszwalb, B. Ross, D. Girshick, D. Mcallester et al., Object detection with discriminatively trained part-based models, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.9, pp.1627-1645, 2010.

R. Sicre and F. Jurie, Discriminative part model for visual recognition, Computer Vision and Image Understanding, vol.141, pp.28-37, 2015.
DOI : 10.1016/j.cviu.2015.08.002

URL : https://hal.archives-ouvertes.fr/hal-01132389

C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed et al., Going deeper with convolutions, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.1-9, 2015.
DOI : 10.1109/CVPR.2015.7298594

URL : http://arxiv.org/abs/1409.4842

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, 2015.

G. Tolias, R. Sicre, and H. Jégou, Particular object retrieval with integral max-pooling of cnn activations, 2016.

R. Sicre and H. Jégou, Memory Vectors for Particular Object Retrieval with Multiple Queries, Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, ICMR '15, pp.479-482, 2015.
DOI : 10.1145/2671188.2749306

K. Chatfield, K. Simonyan, A. Vedaldi, and A. Zisserman, Return of the Devil in the Details: Delving Deep into Convolutional Nets, Proceedings of the British Machine Vision Conference 2014, 2014.
DOI : 10.5244/C.28.6