Region-Based Image Classification with a Latent SVM Model

Oksana Yakhnenko 1 Jakob Verbeek 1 Cordelia Schmid 1
1 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : Image classification is a challenging problem due to intra-class appearance variation, background clutter, occlusion, and photometric variability. Current state-of-the-art methods do not explicitly handle background clutter, but rely on global image representations, such as bag-of-word (BoW) models. Multiple-instance learning has been used to explicitly deal with clutter, classifying an image positively as soon as at least one image region is classified positively. In this paper, we propose a more robust latent-SVM model that, unlike multiple-instance learning, does not rely on a single image region to trigger a positive image classification. Rather, our model scores an images using all regions, and associates with each region a latent variable that indicates whether the region represents the object of interest or its background. Background and foreground regions are each scored by a different appearance model, and an additional term in the score function ensures that neighboring regions tend to take the same background/foreground label. We learn the parameters of our latent SVM model using an iterative procedure that alternates between inferring the latent variables, and updating the parameters. We compare the performance of our approach on the PASCAL VOC'07 dataset to that of SVMs trained on global BoW representations, and to a multiple-instance SVM trained on BoW representations of image regions. We show that our approach outperforms multiple-instance learning by a large margin on all classes, and outperforms global BoW models in 17 out of the 20 classes.
Type de document :
Rapport
[Research Report] RR-7665, INRIA. 2011
Liste complète des métadonnées

Littérature citée [28 références]  Voir  Masquer  Télécharger


https://hal.inria.fr/inria-00605344
Contributeur : Jakob Verbeek <>
Soumis le : vendredi 1 juillet 2011 - 13:54:51
Dernière modification le : mercredi 11 avril 2018 - 01:58:20
Document(s) archivé(s) le : lundi 12 novembre 2012 - 09:52:19

Fichiers

RR-7665.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00605344, version 1

Citation

Oksana Yakhnenko, Jakob Verbeek, Cordelia Schmid. Region-Based Image Classification with a Latent SVM Model. [Research Report] RR-7665, INRIA. 2011. 〈inria-00605344〉

Partager

Métriques

Consultations de la notice

1105

Téléchargements de fichiers

1960