Category level object segmentation by combining bag-of-words models with Dirichlet processes and random fields

Diane Larlus 1 Jakob Verbeek 1 Frédéric Jurie 2
1 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
2 Equipe Image - Laboratoire GREYC - UMR6072
GREYC - Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen
Abstract : This paper addresses the problem of accurately segmenting instances of object classes in images without any human interaction. Our model combines a bag-of-words recognition component with spatial regularization based on a random field and a Dirichlet process mixture. Bag-ofwords models successfully predict the presence of an object within an image; however, they can not accurately locate object boundaries. Random Fields take into account the spatial layout of images and provide local spatial regularization. Yet, as they use local coupling between image labels, they fail to capture larger scale structures needed for object recognition. These components are combined with a Dirichlet process mixture. It models images as a composition of regions, each representing a single object instance. Gibbs sampling is used for parameter estimations and object segmentation. Our model successfully segments object category instances, despite cluttered backgrounds and large variations in appearance and viewpoints. The strengths and limitations of our model are shown through extensive experimental evaluations. First, we evaluate the result of two methods to build visual vocabularies. Second, we show how to combine strong labeling (segmented images) with weak labeling (images annotated with bounding boxes), in order to limit the labeling effort needed to learn the model. Third, we study the effect of different initializations. We present results on four image databases, including the challenging PASCAL VOC 2007 data set on which we obtain state-of-the art results.
Document type :
Journal articles
Complete list of metadatas

https://hal.inria.fr/inria-00439303
Contributor : Jakob Verbeek <>
Submitted on : Tuesday, January 25, 2011 - 9:52:33 AM
Last modification on : Tuesday, February 5, 2019 - 12:12:43 PM
Long-term archiving on : Tuesday, April 26, 2011 - 2:35:28 AM

File

verbeek10ijcv.pdf
Files produced by the author(s)

Identifiers

Citation

Diane Larlus, Jakob Verbeek, Frédéric Jurie. Category level object segmentation by combining bag-of-words models with Dirichlet processes and random fields. International Journal of Computer Vision, Springer Verlag, 2010, 88 (2), pp.238--253. ⟨10.1007/s11263-009-0245-x⟩. ⟨inria-00439303v1⟩

Share

Metrics

Record views

52

Files downloads

113