Weakly supervised learning of visual models and its application to content-based retrieval

Cordelia Schmid

doi:10.1023/B:VISI.0000004829.38247.b0

Article Dans Une Revue International Journal of Computer Vision Année : 2004

Weakly supervised learning of visual models and its application to content-based retrieval

(1)

Cordelia Schmid

Fonction : Auteur correspondant
PersonId : 831154

Connectez-vous pour contacter l'auteur

Learning and recognition in vision

Résumé

This paper presents a method for weakly supervised learning of visual models. The visual model is based on a two-layer image description: a set of "generic" descriptors and their distribution over neighbourhoods. "Generic" descriptors represent sets of similar rotational invariant feature vectors. Statistical spatial constraints describe the neighborhood structure and make our description more discriminant. The joint probability of the frequencies of "generic" descriptors over a neighbourhood is multi-modal and is represented by a set of "neighbourhood-frequency" clusters. Our image description is rotationally invariant, robust to model deformations and characterizes efficiently "appearance-based" visual structure. The selection of distinctive clusters determines model features (common to the positive and rare in the negative examples). Visual models are retrieved and localized using a probabilistic score. Experimental results for "textured" animals and faces show a very good performance for retrieval as well as localization.

Mots clés

visual model two-layer image description weakly supervised learning

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

schmid_ijcv2004.pdf (420.71 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

THOTH Team : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00548553

Soumis le : lundi 20 décembre 2010-09:09:43

Dernière modification le : jeudi 4 avril 2024-21:36:21

Archivage à long terme le : lundi 21 mars 2011-03:17:07

Dates et versions

inria-00548553 , version 1 (20-12-2010)

Identifiants

HAL Id : inria-00548553 , version 1
DOI : 10.1023/B:VISI.0000004829.38247.b0

Citer

Cordelia Schmid. Weakly supervised learning of visual models and its application to content-based retrieval. International Journal of Computer Vision, 2004, Special Issue on Content-Based Image Retrieval, 56 (1), pp.7--16. ⟨10.1023/B:VISI.0000004829.38247.b0⟩. ⟨inria-00548553⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA IMAG CNRS INRIA INRIA2

109 Consultations

377 Téléchargements

Weakly supervised learning of visual models and its application to content-based retrieval

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager