TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation

Matthieu Guillaumin; Thomas Mensink; Jakob Verbeek; Cordelia Schmid

doi:10.1109/ICCV.2009.5459266

Communication Dans Un Congrès Année : 2009

TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation

(1) , (1) , (1) , (1)

Matthieu Guillaumin

Fonction : Auteur

Learning and recognition in vision

Thomas Mensink

Fonction : Auteur
PersonId : 853681

Learning and recognition in vision

Jakob Verbeek

Fonction : Auteur
PersonId : 10676
IdHAL : verbeek
ORCID : 0000-0003-1419-1816
IdRef : 180998463

Learning and recognition in vision

Cordelia Schmid

Fonction : Auteur
PersonId : 831154

Learning and recognition in vision

Résumé

Image auto-annotation is an important open problem in computer vision. For this task we propose TagProp, a discriminatively trained nearest neighbor model. Tags of test images are predicted using a weighted nearest-neighbor model to exploit labeled training images. Neighbor weights are based on neighbor rank or distance. TagProp allows the integration of metric learning by directly maximizing the log-likelihood of the tag predictions in the training set. In this manner, we can optimally combine a collection of image similarity metrics that cover different aspects of image content, such as local shape descriptors, or global color histograms. We also introduce a word specific sigmoidal modulation of the weighted neighbor tag predictions to boost the recall of rare words. We investigate the performance of different variants of our model and compare to existing work. We present experimental results for three challenging data sets. On all three, TagProp makes a marked improvement as compared to the current state-of-the-art.

Mots clés

image processing learning (artificial intelligence)

Domaines

Apprentissage [cs.LG]

Fichier principal

GMVS09.pdf (239.37 Ko)

GMVS.png (174.24 Ko)

SupplMat.pdf (6.24 Mo)

slides.pdf (2.15 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Format : Figure, Image

Format : Autre

THOTH Team : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00439276

Soumis le : jeudi 2 décembre 2010-13:50:06

Dernière modification le : jeudi 4 avril 2024-21:06:51

Archivage à long terme le : lundi 5 novembre 2012-11:05:29

Dates et versions

inria-00439276 , version 1 (02-12-2010)

Identifiants

HAL Id : inria-00439276 , version 1
DOI : 10.1109/ICCV.2009.5459266

Citer

Matthieu Guillaumin, Thomas Mensink, Jakob Verbeek, Cordelia Schmid. TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation. ICCV 2009 - 12th International Conference on Computer Vision, Sep 2009, Kyoto, Japan. pp.309-316, ⟨10.1109/ICCV.2009.5459266⟩. ⟨inria-00439276⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UGA CNRS INRIA IRISA LJK LJK_GI LJK_GI_LEAR INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES ANR UR1-MATH-NUM

659 Consultations

3095 Téléchargements

TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager