Automatic Face Naming with Caption-based Supervision - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

Automatic Face Naming with Caption-based Supervision

Thomas Mensink
  • Fonction : Auteur
  • PersonId : 853681
Jakob Verbeek
Cordelia Schmid
  • Fonction : Auteur
  • PersonId : 831154

Résumé

We consider two scenarios of naming people in databases of news photos with captions: (i) finding faces of a single person, and (ii) assigning names to all faces. We combine an initial text-based step, that restricts the name assigned to a face to the set of names appearing in the caption, with a second step that analyzes visual features of faces. By searching for groups of highly similar faces that can be associated with a name, the results of purely text-based search can be greatly ameliorated. We improve a recent graph-based approach, in which nodes correspond to faces and edges connect highly similar faces. We introduce constraints when optimizing the objective function, and propose improvements in the low-level methods used to construct the graphs. Furthermore, we generalize the graph-based approach to face naming in the full data set. In this multi-person naming case the optimization quickly becomes computationally demanding, and we present an important speed-up using graph-flows to compute the optimal name assignments in documents. Generative models have previously been proposed to solve the multi-person naming task. We compare the generative and graph-based methods in both scenarios, and find significantly better performance using the graph-based methods in both cases.
Fichier principal
Vignette du fichier
GMVS08.pdf (781.35 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

inria-00321048 , version 1 (25-01-2011)
inria-00321048 , version 2 (11-04-2011)

Identifiants

Citer

Matthieu Guillaumin, Thomas Mensink, Jakob Verbeek, Cordelia Schmid. Automatic Face Naming with Caption-based Supervision. IEEE Conference on Computer Vision & Pattern Recognition (CPRV '08), Jun 2008, Anchorage, United States. pp.1 - 8, ⟨10.1109/CVPR.2008.4587603⟩. ⟨inria-00321048v1⟩
372 Consultations
1938 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More