Fisher Vectors for Fine-Grained Visual Categorization - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

Fisher Vectors for Fine-Grained Visual Categorization

Résumé

The bag-of-visual-words (BOV) is certainly the most popular image representation to date and it has been shown to yield good results in various problems including Fine-Grained Visual Categorization (FGVC) [3, 4]. Our contribution is to show that the Fisher Vector (FV) - which describes an image by its deviation from an "average" model - is an excellent alternative to the BOV for the FGVC problem. In this extended abstract we first provide a brief introduction to the FV. We then present theoretical as well as practical motivations for using the FV for FGVC. We finally provide experimental results on four ImageNet subsets: fungus, ungulate, vehicle and ImageNet10K. Compared to [4] which uses spatial pyramid (SP) BOV representations, we report significantly higher classification accuracies. For instance, on ImageNet10K we report 16.7% vs 6.4% top-1 accuracy (a 160% relative improvement).
Fichier principal
Vignette du fichier
fgvc11.pdf (38.76 Ko) Télécharger le fichier
fgvc11-poster.pdf (2.66 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Format : Autre

Dates et versions

hal-00817681 , version 1 (25-04-2013)

Identifiants

  • HAL Id : hal-00817681 , version 1

Citer

Jorge Sánchez, Florent Perronnin, Zeynep Akata. Fisher Vectors for Fine-Grained Visual Categorization. FGVC Workshop in IEEE Computer Vision and Pattern Recognition (CVPR), IEEE, Jun 2011, Colorado Springs, United States. ⟨hal-00817681⟩
561 Consultations
521 Téléchargements

Partager

Gmail Facebook X LinkedIn More