Multiple instance metric learning from automatically labeled bags of faces

Matthieu Guillaumin; Jakob Verbeek; Cordelia Schmid

doi:10.1007/978-3-642-15549-9_46

Communication Dans Un Congrès Année : 2010

Multiple instance metric learning from automatically labeled bags of faces

(1) , (1) , (1)

Matthieu Guillaumin

Fonction : Auteur
PersonId : 879978

Learning and recognition in vision

Jakob Verbeek

Fonction : Auteur
PersonId : 10676
IdHAL : verbeek
ORCID : 0000-0003-1419-1816
IdRef : 180998463

Learning and recognition in vision

Cordelia Schmid

Fonction : Auteur
PersonId : 831154

Learning and recognition in vision

Résumé

Metric learning aims at finding a distance that approximates a task-specific notion of semantic similarity. Typically, a Mahalanobis distance is learned from pairs of data labeled as being semantically similar or not. In this paper, we learn such metrics in a weakly supervised setting where "bags" of instances are labeled with "bags" of labels. We formulate the problem as a multiple instance learning (MIL) problem over pairs of bags. If two bags share at least one label, we label the pair positive, and negative otherwise. We propose to learn a metric using those labeled pairs of bags, leading to MildML, for multiple instance logistic discriminant metric learning. MildML iterates between updates of the metric and selection of putative positive pairs of examples from positive pairs of bags. To evaluate our approach, we introduce a large and challenging data set, Labeled Yahoo! News, which we have manually annotated and contains 31147 detected faces of 5873 different people in 20071 images. We group the faces detected in an image into a bag, and group the names detected in the caption into a corresponding set of labels. When the labels come from manual annotation, we find that MildML using the bag-level annotation performs as well as fully supervised metric learning using instance-level annotation. We also consider performance in the case of automatically extracted labels for the bags, where some of the bag labels do not correspond to any example in the bag. In this case MildML works substantially better than relying on noisy instance-level annotations derived from the bag-level annotation by resolving face-name associations in images with their captions.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

GVS10a.pdf (274.37 Ko)

GVS10a2.png (134.1 Ko)

poster.pdf (538.26 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Format : Figure, Image

Format : Autre

THOTH Team : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00548639

Soumis le : lundi 20 décembre 2010-10:23:21

Dernière modification le : jeudi 4 avril 2024-21:15:46

Archivage à long terme le : lundi 5 novembre 2012-14:36:57

Dates et versions

inria-00548639 , version 1 (20-12-2010)

Identifiants

HAL Id : inria-00548639 , version 1
DOI : 10.1007/978-3-642-15549-9_46

Citer

Matthieu Guillaumin, Jakob Verbeek, Cordelia Schmid. Multiple instance metric learning from automatically labeled bags of faces. ECCV 2010 - European Conference on Computer Vision, Sep 2010, Heraklion, Greece. pp.634-647, ⟨10.1007/978-3-642-15549-9_46⟩. ⟨inria-00548639⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UGA CNRS INRIA IRISA LJK LJK_GI LJK_GI_LEAR INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

985 Consultations

3663 Téléchargements

Multiple instance metric learning from automatically labeled bags of faces

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager