Image categorization using Fisher kernels of non-iid image models

Ramazan Gokberk Cinbis; Jakob Verbeek; Cordelia Schmid

doi:10.1109/CVPR.2012.6247926

Conference Papers Year : 2012

Image categorization using Fisher kernels of non-iid image models

(1) , (1) , (1)

Ramazan Gokberk Cinbis

Function : Author
PersonId : 933132

Learning and recognition in vision

Jakob Verbeek

Function : Author
PersonId : 10676
IdHAL : verbeek
ORCID : 0000-0003-1419-1816
IdRef : 180998463

Learning and recognition in vision

Cordelia Schmid

Function : Author
PersonId : 831154

Learning and recognition in vision

Abstract

The bag-of-words (BoW) model treats images as an unordered set of local regions and represents them by visual word histograms. Implicitly, regions are assumed to be identically and independently distributed (iid), which is a poor assumption from a modeling perspective. We introduce non-iid models by treating the parameters of BoW models as latent variables which are integrated out, rendering all local regions dependent. Using the Fisher kernel we encode an image by the gradient of the data log-likelihood w.r.t. hyper-parameters that control priors on the model parameters. Our representation naturally involves discounting transformations similar to taking square-roots, providing an explanation of why such transformations have proven successful. Using variational inference we extend the basic model to include Gaussian mixtures over local descriptors, and latent topic models to capture the co-occurrence structure of visual words, both improving performance. Our models yield state-of-the-art categorization performance using linear classifiers; without using non-linear transformations such as taking square-roots of features, or using (approximate) explicit embeddings of non-linear kernels.

Domains

Computer Vision and Pattern Recognition [cs.CV]

Fichier principal

paper_final.pdf (560.01 Ko)

fig1.jpg (29.81 Ko)

Origin : Files produced by the author(s)

Format : Figure, Image

THOTH Team : Connect in order to contact the contributor

https://inria.hal.science/hal-00685943

Submitted on : Friday, April 6, 2012-2:08:50 PM

Last modification on : Thursday, April 4, 2024-9:06:51 PM

Long-term archiving on: Saturday, July 7, 2012-2:35:22 AM

Dates and versions

hal-00685943 , version 1 (06-04-2012)

Identifiers

HAL Id : hal-00685943 , version 1
DOI : 10.1109/CVPR.2012.6247926

Cite

Ramazan Gokberk Cinbis, Jakob Verbeek, Cordelia Schmid. Image categorization using Fisher kernels of non-iid image models. CVPR 2012 - IEEE Conference on Computer Vision & Pattern Recognition, Jun 2012, Providence, United States. pp.2184-2191, ⟨10.1109/CVPR.2012.6247926⟩. ⟨hal-00685943⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UGA CNRS INRIA IRISA LJK LJK_GI LJK_GI_LEAR INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

988 View

1967 Download

Image categorization using Fisher kernels of non-iid image models

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Altmetric

Share