Category level object segmentation by combining bag-of-words models and Markov Random Fields

Diane Larlus; Jakob Verbeek; Frédéric Jurie

Rapport (Rapport De Recherche) Année : 2008

Category level object segmentation by combining bag-of-words models and Markov Random Fields

(1) , (1, 2) , (3)

1
2
3

Diane Larlus

Fonction : Auteur

Learning and recognition in vision

Jakob Verbeek

Fonction : Auteur
PersonId : 10676
IdHAL : verbeek
ORCID : 0000-0003-1419-1816
IdRef : 180998463

Learning and recognition in vision

Instituut voor Informatica

Frédéric Jurie

Fonction : Auteur
PersonId : 3233
IdHAL : frederic-jurie
ORCID : 0000-0002-2686-0020
IdRef : 080485022

Equipe Image - Laboratoire GREYC - UMR6072

Résumé

This paper presents an approach to segment unseen objects of known categories. At the heart of the approach lies a probabilistic model of images which captures local appearance of objects through a bag-of-words representation. Bag-of-words models have been very successful for image categorization; however, as they model objects as loose collections of small image patches, they can not accurately predict object boundaries. On the other hand, Markov Random Fields (MRFs), which are often used in many low-level application for general purpose image segmentation, do incorporate the spatial layout of images. Yet, as they are usually based on very local image evidence they fail to capture larger scale structures needed to recognize object categories under large appearance variations. The main contribution of this article is to combine the advantages of both approaches into a single probabilistic model. First, a mechanism based on a bag-of-words representation produces object recognition and localization at a rough spatial resolution. Second, a MRF component enforces precise object boundaries, guided by local image cues (color, texture, and edges) and by long-distance dependencies. Gibbs sampling is used to infer the model parameters and the object segmentation. The proposed method successfully segments object categories, despite highly varying appearances, cluttered backgrounds and large viewpoint changes. Through a series of experiments, we emphasize the strength as well as the limitation of our model. First, we evaluate the results of several strategies for building the visual vocabulary. Second, we show how it is possible to combine strong labeling (segmented images) with weak labeling (images annotated with bounding boxes), in order to limit the amount of training data needed to learn the model. Third, we study the influence of the initialization on the model estimation. Last, we present extensive experiments on four different image databases, including the challenging Pascal VOC 2007 dataset on which we obtain state-of-the art results.

Domaines

Apprentissage [cs.LG]

Fichier principal

verbeek08tr.pdf (13.02 Mo)

publicationPage.png (185.56 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Format : Figure, Image

Jakob Verbeek : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00333121

Soumis le : lundi 11 avril 2011-14:19:07

Dernière modification le : jeudi 4 avril 2024-21:05:40

Archivage à long terme le : samedi 3 décembre 2016-19:10:09

Dates et versions

inria-00333121 , version 1 (22-10-2008)

inria-00333121 , version 2 (11-04-2011)

Identifiants

HAL Id : inria-00333121 , version 2

Citer

Diane Larlus, Jakob Verbeek, Frédéric Jurie. Category level object segmentation by combining bag-of-words models and Markov Random Fields. [Research Report] RR-6668, INRIA. 2008, pp.31. ⟨inria-00333121v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UGA CNRS INRIA IRISA INRIA-RRRT LJK LJK_GI LJK_GI_LEAR GREYC GREYC-IMAGE COMUE-NORMANDIE INRIA2 LARA UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES ENSICAEN UNICAEN UR1-MATH-NUM

703 Consultations

726 Téléchargements

Category level object segmentation by combining bag-of-words models and Markov Random Fields

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager