J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, pp.1470-1477, 2003.
DOI : 10.1109/ICCV.2003.1238663

G. Csurka, C. R. Dance, L. Fan, J. Willamowski, and C. Bray, Visual Categorization with Bags of Keypoints, ECCV Workshop on Statistical Learning in Computer Vision, pp.1-22, 2004.

D. G. Lowe, Object recognition from local scale-invariant features, Proceedings of the Seventh IEEE International Conference on Computer Vision, pp.1150-1157, 1999.
DOI : 10.1109/ICCV.1999.790410
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.121.4065

K. Mikolajczyk and C. Schmid, A performance evaluation of local descriptors, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.27, issue.10, pp.1615-1630, 2005.
DOI : 10.1109/TPAMI.2005.188
URL : https://hal.archives-ouvertes.fr/inria-00548227

K. E. Van-de-sande, T. Gevers, and C. G. Snoek, A comparison of color features for visual concept classification, Proceedings of the 2008 international conference on Content-based image and video retrieval, CIVR '08, pp.141-149, 2008.
DOI : 10.1145/1386352.1386376

J. C. Van-gemert, J. M. Geusebroek, C. J. Veenman, and A. W. Smeulders, Kernel Codebooks for Scene Categorization, pp.696-709, 2008.
DOI : 10.1007/978-3-540-88690-7_52

J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman, Lost in quantization: Improving particular object retrieval in large scale image databases, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587635

P. Koniusz and K. Mikolajczyk, Soft assignment of visual words as Linear Coordinate Coding and optimisation of its reconstruction error, 2011 18th IEEE International Conference on Image Processing, 2011.
DOI : 10.1109/ICIP.2011.6116129

P. Koniusz, F. Yan, and K. Mikolajczyk, Comparison of mid-level feature coding approaches and pooling strategies in visual concept detection, Computer Vision and Image Understanding, vol.117, issue.5, p.2012
DOI : 10.1016/j.cviu.2012.10.010

H. Lee, A. Battle, R. Raina, and A. Y. Ng, Efficient Sparse Coding Algorithms, NIPS, pp.801-808, 2007.

J. Yang, K. Yu, Y. Gong, and T. S. Huang, Linear Spatial Pyramid Matching using Sparse Coding for Image Classification, pp.1794-1801, 2009.

K. Yu, T. Zhang, and Y. Gong, Nonlinear Learning using Local Coordinate Coding, 2009.

J. Wang, J. Yang, K. Yu, F. Lv, T. Huang et al., Localityconstrained Linear Coding for Image Classification, 2010.

S. Gao, I. W. Tsang, L. Chia, and P. Zhao, Local features are not lonely – Laplacian sparse coding for image classification, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539943

X. Zhou, K. Yu, T. Zhang, and T. S. Huang, Image Classification Using Super-Vector Coding of Local Image Descriptors, pp.141-154, 2010.
DOI : 10.1007/978-3-642-15555-0_11

H. Jegou, M. Douze, C. Schmid, and P. Pérez, Aggregating local descriptors into a compact image representation, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.3304-3311, 2010.
DOI : 10.1109/CVPR.2010.5540039
URL : https://hal.archives-ouvertes.fr/inria-00548637

F. Perronnin and C. Dance, Fisher Kernels on Visual Vocabularies for Image Categorization, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2007.
DOI : 10.1109/CVPR.2007.383266

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, pp.143-156, 2010.
DOI : 10.1007/978-3-642-15561-1_11
URL : https://hal.archives-ouvertes.fr/inria-00548630

R. Negrel, D. Picard, and P. Gosselin, Compact tensor based image representation for similarity search, 2012 19th IEEE International Conference on Image Processing, p.2012
DOI : 10.1109/ICIP.2012.6467387
URL : https://hal.archives-ouvertes.fr/hal-00753157

S. Boughorbel, J. Tarel, and N. Boujemaa, Generalized histogram intersection kernel for image recognition, IEEE International Conference on Image Processing 2005, pp.161-164, 2005.
DOI : 10.1109/ICIP.2005.1530353

H. Jégou, M. Douze, and C. Schmid, On the burstiness of visual elements, 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp.1169-1176, 2009.
DOI : 10.1109/CVPR.2009.5206609

J. Yang, Y. Jiang, A. G. Hauptmann, and C. Ngo, Evaluating bag-of-visual-words representations in scene classification, Proceedings of the international workshop on Workshop on multimedia information retrieval , MIR '07, pp.197-206, 2007.
DOI : 10.1145/1290082.1290111

K. Chatfield, V. Lempitsky, A. Vedaldi, and A. Zisserman, The devil is in the details: an evaluation of recent feature encoding methods, Procedings of the British Machine Vision Conference 2011, 2011.
DOI : 10.5244/C.25.76

A. Coates and A. Ng, The Importance of Encoding Versus Training with Sparse Coding and Vector Quantization, pp.921-928, 2011.

Y. Boureau, F. Bach, Y. Lecun, and J. Ponce, Learning mid-level features for recognition, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539963

Y. Boureau, J. Ponce, and Y. Lecun, A Theoretical Analysis of Feature Pooling in Vision Algorithms, 2010.

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.2169-2178, 2006.
DOI : 10.1109/CVPR.2006.68
URL : https://hal.archives-ouvertes.fr/inria-00548585

P. Koniusz and K. Mikolajczyk, Spatial Coordinate Coding to reduce histogram representations, Dominant Angle and Colour Pyramid Match, 2011 18th IEEE International Conference on Image Processing, 2011.
DOI : 10.1109/ICIP.2011.6116639

J. Sánchez, F. Perronnin, and T. E. De-campos, Modeling the spatial layout of images beyond spatial pyramids, Pattern Recognition Letters, vol.33, issue.16, p.2012
DOI : 10.1016/j.patrec.2012.07.019

L. D. Lathauwer, B. D. Moor, and J. Vandewalle, A Multilinear Singular Value Decomposition, SIAM Journal on Matrix Analysis and Applications, vol.21, issue.4, pp.1253-1278, 2000.
DOI : 10.1137/S0895479896305696

T. G. Kolda and B. W. Bader, Tensor Decompositions and Applications, SIAM Review, vol.51, issue.3, pp.455-500, 2009.
DOI : 10.1137/07070111X
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.130.782

J. Carreira, R. Caseiro, J. Batista, and C. Sminchisescu, Semantic Segmentation with Second-Order Pooling, p.2012
DOI : 10.1007/978-3-642-33786-4_32

X. Yu and Y. Zhang, A 2D histogram representation of images for pooling, Image Processing: Machine Vision Applications IV, 2011.
DOI : 10.1117/12.872257

M. Tahir, J. Kittler, K. Mikolajczyk, F. Yan, K. Van-de-sande et al., Visual category recognition using Spectral Regression and Kernel Discriminant Analysis, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, 2009.
DOI : 10.1109/ICCVW.2009.5457703

J. Bilmes, A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models, ICSI, 1997.

D. Picard and P. Gosselin, Improving image similarity with vectors of locally aggregated tensors, 2011 18th IEEE International Conference on Image Processing, 2011.
DOI : 10.1109/ICIP.2011.6116641
URL : https://hal.archives-ouvertes.fr/hal-00591993

P. Koniusz and K. Mikolajczyk, On a Quest for Image Descriptors Based on Unsupervised Segmentation Maps Automated Flower Classification over a Large Number of Classes, pp.762-765, 2008.

M. Nilsback and A. Zisserman, Automated Flower Classification over a Large Number of Classes, 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing, 2008.
DOI : 10.1109/ICVGIP.2008.47

X. Yuan and S. Yan, Visual classification with multi-task joint sparse representation, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539967

J. Yang, Y. Tian, L. Duan, T. Huang, and W. Gao, Group- Sensitive Multiple Kernel Learning for Object Recognition, pp.2838-2852, 2012.

F. Yan, K. Mikolajczyk, M. Barnard, H. Cai, and J. Kittler, Lp Norm Multiple Kernel Fisher Discriminant Analysis for Object and Image Categorisation, 2010.

S. Nowak, K. Nagel, and J. Liebetra, The CLEF 2011 Photo Annotation and Concept-based Retrieval Tasks, 2011.

M. J. Huiskes and M. S. Lew, The MIR flickr retrieval evaluation, Proceeding of the 1st ACM international conference on Multimedia information retrieval, MIR '08, pp.39-43, 2008.
DOI : 10.1145/1460096.1460104

M. A. Tahir, F. Yan, M. Barnard, M. Awais, K. Mikolajczyk et al., The University of Surrey Visual Concept Detection System at ImageCLEF 2010: Working Notes, 2010.

A. Binder, W. Samek, and M. Kawanabe, The joint Submission of the TU Berlin and Fraunhofer FIRST (TUBFI) to the Image- CLEF 2011 Photo Annotation Task: Working Notes, 2011.

A. Hegerath, T. Deselaers, and H. Ney, Patch-based Object Recognition Using Discriminatively Trained Gaussian Mixtures, Procedings of the British Machine Vision Conference 2006, pp.519-528, 2006.
DOI : 10.5244/C.20.54
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.214.7347

J. Yang, Z. Wang, Z. Lin, X. Shu, and T. Huang, Bilevel Sparse Coding for Coupled Feature Spaces, p.2012

S. Avila, N. Thome, M. Cord, E. Valle, and A. De-arajo, Pooling in image representation: The visual codeword point of view, Computer Vision and Image Understanding, vol.117, issue.5, p.2012
DOI : 10.1016/j.cviu.2012.09.007
URL : https://hal.archives-ouvertes.fr/hal-01172709

M. Awais, F. Yan, K. Mikolajczyk, and J. Kittler, Novel Fusion Methods for Pattern Recognition, 2011.
DOI : 10.1007/978-3-642-23780-5_19

B. Yao, X. Jiang, A. Khosla, A. L. Lin, L. J. Guibas et al., Human action recognition by learning bases of action attributes and parts, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126386

N. Kulkarni and B. Li, Discriminative affine sparse codes for image classification, CVPR 2011, pp.1609-1616, 2011.
DOI : 10.1109/CVPR.2011.5995701

C. Zhang, Q. Huang, J. Liu, Q. Tian, C. Liang et al., Image Classification Using Haar-like Transformation of Local Features with Coding Residuals, SP, 2012.

B. Yao, A. Khosla, and L. Fei-fei, Combining randomization and discrimination for fine-grained image categorization, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995368

M. Everingham, L. Van-gool, C. K. Williams, J. Winn, and A. Zisserman, The PASCAL Visual Object Classes Challenge 2007-2012 Results, 2012.

L. Fei-fei, R. Fergus, and P. Perona, Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories, CVPR Workshop on Generative-Model Based Vision, 2004.
DOI : 10.1016/j.cviu.2005.09.012

M. J. Huiskes, B. Thomee, and M. S. Lew, New trends and ideas in visual concept detection, Proceedings of the international conference on Multimedia information retrieval, MIR '10, pp.527-536, 2010.
DOI : 10.1145/1743384.1743475

J. Mairal, F. Bach, J. Ponce, and G. Sapiro, Online Learning for Matrix Factorization and Sparse Coding, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00408716

P. Koniusz, Novel Image Representations for Visual Categorisation with Bag-of-Words, 2013.