Category level object segmentation by combining bag-of-words models with Dirichlet processes and random fields

Diane Larlus; Jakob Verbeek; Frédéric Jurie

doi:10.1007/s11263-009-0245-x

Article Dans Une Revue International Journal of Computer Vision Année : 2010

Category level object segmentation by combining bag-of-words models with Dirichlet processes and random fields

(1) , (1) , (1, 2)

1
2

Diane Larlus

Fonction : Auteur

Learning and recognition in vision

Jakob Verbeek

Fonction : Auteur
PersonId : 10676
IdHAL : verbeek
ORCID : 0000-0003-1419-1816
IdRef : 180998463

Learning and recognition in vision

Frédéric Jurie

Fonction : Auteur
PersonId : 3233
IdHAL : frederic-jurie
ORCID : 0000-0002-2686-0020
IdRef : 080485022

Learning and recognition in vision

Equipe Image - Laboratoire GREYC - UMR6072

Résumé

This paper addresses the problem of accurately segmenting instances of object classes in images without any human interaction. Our model combines a bag-of-words recognition component with spatial regularization based on a random field and a Dirichlet process mixture. Bag-ofwords models successfully predict the presence of an object within an image; however, they can not accurately locate object boundaries. Random Fields take into account the spatial layout of images and provide local spatial regularization. Yet, as they use local coupling between image labels, they fail to capture larger scale structures needed for object recognition. These components are combined with a Dirichlet process mixture. It models images as a composition of regions, each representing a single object instance. Gibbs sampling is used for parameter estimations and object segmentation. Our model successfully segments object category instances, despite cluttered backgrounds and large variations in appearance and viewpoints. The strengths and limitations of our model are shown through extensive experimental evaluations. First, we evaluate the result of two methods to build visual vocabularies. Second, we show how to combine strong labeling (segmented images) with weak labeling (images annotated with bounding boxes), in order to limit the labeling effort needed to learn the model. Third, we study the effect of different initializations. We present results on four image databases, including the challenging PASCAL VOC 2007 data set on which we obtain state-of-the art results.

Mots clés

Object recognition Segmentation Random fields

Domaines

Apprentissage [cs.LG]

Fichier principal

segmentation.pdf (2.12 Mo)

LVJ.png (112.14 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Format : Figure, Image

Jakob Verbeek : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00439303

Soumis le : mercredi 27 avril 2011-13:30:48

Dernière modification le : samedi 27 avril 2024-03:09:42

Archivage à long terme le : jeudi 8 novembre 2012-17:30:19

Dates et versions

inria-00439303 , version 1 (25-01-2011)

inria-00439303 , version 2 (27-04-2011)

Identifiants

HAL Id : inria-00439303 , version 2
DOI : 10.1007/s11263-009-0245-x

Citer

Diane Larlus, Jakob Verbeek, Frédéric Jurie. Category level object segmentation by combining bag-of-words models with Dirichlet processes and random fields. International Journal of Computer Vision, 2010, 88 (2), pp.238-253. ⟨10.1007/s11263-009-0245-x⟩. ⟨inria-00439303v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UGA CNRS INRIA IRISA INSMI LJK LJK_GI LJK_GI_LEAR GREYC GREYC-IMAGE COMUE-NORMANDIE INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES ENSICAEN UNICAEN UR1-MATH-NUM

795 Consultations

651 Téléchargements

Category level object segmentation by combining bag-of-words models with Dirichlet processes and random fields

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager