Analysis of I-Vector framework for Speaker Identification in TV-shows

Corinne Fredouille; Delphine Charlet

Communication Dans Un Congrès Année : 2014

Analysis of I-Vector framework for Speaker Identification in TV-shows

(1) , (2)

1
2

Corinne Fredouille

Fonction : Auteur
PersonId : 173870
IdHAL : corinne-fredouille
ORCID : 0000-0002-0413-8950
IdRef : 079420516

Laboratoire Informatique d'Avignon

Delphine Charlet

Fonction : Auteur
PersonId : 1005321

Orange Labs [Lannion]

Résumé

Inspired from the Joint Factor Analysis, the I-vector-based analysis has become the most popular and state-of-the-art framework for the speaker verification task. Mainly applied within the NIST/SRE evaluation campaigns, many studies have been proposed to improve more and more performance of speaker verification systems. Nevertheless, while the i-vector framework has been used in other speech processing fields like language recognition, a very few studies have been reported for the speaker identification task on TV shows. This work was done in the REPERE challenge context, focused on the people recognition task in multimodal conditions (audio, video, text) from TV show corpora. Moreover, the challenge participants are invited for providing systems for monomodal tasks, like speaker identification. The application of the i-vector framework is investi-gatedthrough different points of views: (1) some of the i-vector based approaches are compared, (2) a specific i-vector extraction protocol is proposed in order to deal with widely varying amounts of training data among speaker population, (3) the joint use of both speaker diarization and identification is finally analyzed. Based on a 533 speaker dictionary, this joint system wins the monomodal speaker identification task of the 2014 REPERE challenge.

Mots clés

speaker identification i-vector REPERE challenge TV shows

Domaines

Informatique et langage [cs.CL]

Fichier principal

i14_0071.pdf (155.92 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Corinne Fredouille : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02102810

Soumis le : vendredi 19 avril 2019-11:59:44

Dernière modification le : dimanche 29 novembre 2020-17:02:03

Dates et versions

hal-02102810 , version 1 (19-04-2019)

Identifiants

HAL Id : hal-02102810 , version 1

Citer

Corinne Fredouille, Delphine Charlet. Analysis of I-Vector framework for Speaker Identification in TV-shows. Interspeech'2014, Sep 2014, Singapour, Singapore. ⟨hal-02102810⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON LIA

67 Consultations

162 Téléchargements

Analysis of I-Vector framework for Speaker Identification in TV-shows

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager