HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Learning discriminative spatial representation for image classification

Gaurav Sharma 1, 2 Frédéric Jurie 1, 3
1 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, Grenoble INP - Institut polytechnique de Grenoble - Grenoble Institute of Technology
3 Equipe Image - Laboratoire GREYC - UMR6072
GREYC - Groupe de Recherche en Informatique, Image et Instrumentation de Caen
Abstract : Spatial Pyramid Representation (SPR) [7] introduces spatial layout information to the orderless bag-of-features (BoF) representation. SPR has become the standard and has been shown to perform competitively against more complex methods for incorporating spatial layout. In SPR the image is divided into regular grids. However, the grids are taken as uniform spatial partitions without any theoretical motivation. In this paper, we address this issue and propose to learn the spatial partitioning with the BoF representation. We define a space of grids where each grid is obtained by a series of recursive axis aligned splits of cells. We cast the classification problem in a maximum margin formulation with the optimization being over the weight vector and the spatial grid. In addition to experiments on two challenging public datasets (Scene-15 and Pascal VOC 2007) showing that the learnt grids consistently perform better than the SPR while being much smaller in vector length, we also introduce a new dataset of human attributes and show that the current method is well suited to the recognition of spatially localized human attributes.
Document type :
Conference papers
Complete list of metadata

Cited literature [19 references]  Display  Hide  Download

Contributor : Thoth Team Connect in order to contact the contributor
Submitted on : Saturday, August 4, 2012 - 2:58:40 PM
Last modification on : Thursday, January 20, 2022 - 5:28:02 PM
Long-term archiving on: : Friday, December 16, 2016 - 5:09:07 AM


Files produced by the author(s)



Gaurav Sharma, Frédéric Jurie. Learning discriminative spatial representation for image classification. BMVC 2011 - British Machine Vision Conference, Aug 2011, Dundee, United Kingdom. pp.1-11, ⟨10.5244/C.25.6⟩. ⟨hal-00722820⟩



Record views


Files downloads