Region-Based Image Classification with a Latent SVM Model

Oksana Yakhnenko 1 Jakob Verbeek 1 Cordelia Schmid 1
1 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : Image classification is a challenging problem due to intra-class appearance variation, background clutter, occlusion, and photometric variability. Current state-of-the-art methods do not explicitly handle background clutter, but rely on global image representations, such as bag-of-word (BoW) models. Multiple-instance learning has been used to explicitly deal with clutter, classifying an image positively as soon as at least one image region is classified positively. In this paper, we propose a more robust latent-SVM model that, unlike multiple-instance learning, does not rely on a single image region to trigger a positive image classification. Rather, our model scores an images using all regions, and associates with each region a latent variable that indicates whether the region represents the object of interest or its background. Background and foreground regions are each scored by a different appearance model, and an additional term in the score function ensures that neighboring regions tend to take the same background/foreground label. We learn the parameters of our latent SVM model using an iterative procedure that alternates between inferring the latent variables, and updating the parameters. We compare the performance of our approach on the PASCAL VOC'07 dataset to that of SVMs trained on global BoW representations, and to a multiple-instance SVM trained on BoW representations of image regions. We show that our approach outperforms multiple-instance learning by a large margin on all classes, and outperforms global BoW models in 17 out of the 20 classes.
Document type :
Reports
Complete list of metadatas

Cited literature [28 references]  Display  Hide  Download


https://hal.inria.fr/inria-00605344
Contributor : Jakob Verbeek <>
Submitted on : Friday, July 1, 2011 - 1:54:51 PM
Last modification on : Monday, December 17, 2018 - 11:22:02 AM
Long-term archiving on : Monday, November 12, 2012 - 9:52:19 AM

Files

RR-7665.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00605344, version 1

Collections

Citation

Oksana Yakhnenko, Jakob Verbeek, Cordelia Schmid. Region-Based Image Classification with a Latent SVM Model. [Research Report] RR-7665, INRIA. 2011. ⟨inria-00605344⟩

Share

Metrics

Record views

1237

Files downloads

2081