Towards a better integration of written names for unsupervised speakers identification in videos

Existing methods for unsupervised identification of speakers in TV broadcast usually rely on the output of a speaker diariza- tion module and try to name each cluster using names provided by another source of information: we call it "late naming". Hence, written names extracted from title blocks tend to lead to high precision identification, although they cannot correct er- rors made during the clustering step. In this paper, we extend our previous "late naming" ap- proach in two ways: "integrated naming" and "early naming". While "late naming" relies on a speaker diarization module op- timized for speaker diarization, "integrated naming" jointly op- timize speaker diarization and name propagation in terms of identification errors. "Early naming" modifies the speaker di- arization module by adding constraints preventing two clusters with different written names to be merged together. While "integrated naming" yields similar identification per- formance as "late naming" (with better precision), "early nam- ing" improves over this baseline both in terms of identification error rate and stability of the clustering stopping criterion.

Domaines

Recherche d'information [cs.IR]

Fichier principal

POIGNANT--SLAM--2013.pdf (659.62 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Marie-Christine Fauvet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00953089

Soumis le : lundi 3 mars 2014-16:29:47

Dernière modification le : vendredi 5 avril 2024-03:24:14

Archivage à long terme le : samedi 31 mai 2014-10:45:49

Dates et versions

hal-00953089 , version 1 (03-03-2014)

Identifiants

HAL Id : hal-00953089 , version 1

Citer

Johann Poignant, Hervé Bredin, Laurent Besacier, Georges Quénot, Claude Barras. Towards a better integration of written names for unsupervised speakers identification in videos. First Workshop on Speech, Language and Audio in Multimedia, SLAM, 2013, Marseille, France. ⟨hal-00953089⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA LIG LIMSI LIG_TDCGE LIG_TDCGE_GETALP LIG_TDCGE_MRIM SORBONNE-UNIVERSITE POLYTECH-GRENOBLE LISN GS-SPORT-HUMAN-MOVEMENT LIG_SIDCH

202 Consultations

84 Téléchargements