Large Margin GMM for discriminative speaker verifi cation

Reda Jourani; Khalid Daoudi; Régine André-Obrecht; Driss Aboutajdine

Article Dans Une Revue Multimedia Tools and Applications Année : 2012

Large Margin GMM for discriminative speaker verifi cation

(1) , (2) , (1, 3) , (4)

1
2
3
4

Reda Jourani

Fonction : Auteur
PersonId : 881619
IdRef : 165708018

Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio

Khalid Daoudi

Fonction : Auteur
PersonId : 1329075
ORCID : 0000-0003-3536-1060
IdRef : 115483500

Geometry and Statistics in acquisition data

Régine André-Obrecht

Fonction : Auteur
PersonId : 740810
IdHAL : obrecht
IdRef : 060375965

Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio

Université Toulouse III - Paul Sabatier

Driss Aboutajdine

Fonction : Auteur

Laboratoire de Recherche en Informatique et Télécommunications [Rabat]

Résumé

Gaussian mixture models (GMM), trained using the generative cri- terion of maximum likelihood estimation, have been the most popular ap- proach in speaker recognition during the last decades. This approach is also widely used in many other classi cation tasks and applications. Generative learning in not however the optimal way to address classi cation problems. In this paper we rst present a new algorithm for discriminative learning of diagonal GMM under a large margin criterion. This algorithm has the ma- jor advantage of being highly e cient, which allow fast discriminative GMM training using large scale databases. We then evaluate its performances on a full NIST speaker veri cation task using NIST-SRE'2006 data. In particular, we use the popular Symmetrical Factor Analysis (SFA) for session variability compensation. The results show that our system outperforms the state-of-the- art approaches of GMM-SFA and the SVM-based one, GSL-NAP. Relative reductions of the Equal Error Rate of about 9.33% and 14.88% are respec- tively achieved over these systems.

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

JMTA.pdf (267.43 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Khalid Daoudi : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00647983

Soumis le : dimanche 4 décembre 2011-15:32:21

Dernière modification le : jeudi 1 février 2024-10:05:00

Archivage à long terme le : dimanche 4 décembre 2016-23:28:19

Dates et versions

hal-00647983 , version 1 (04-12-2011)

Identifiants

HAL Id : hal-00647983 , version 1

Citer

Reda Jourani, Khalid Daoudi, Régine André-Obrecht, Driss Aboutajdine. Large Margin GMM for discriminative speaker verifi cation. Multimedia Tools and Applications, 2012. ⟨hal-00647983⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLSE2 UNIV-RENNES1 CNRS INRIA IRISA UT1-CAPITOLE INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES IRIT IRIT-SAMOVA UR1-MATH-NUM IRIT-SI TOULOUSE-INP UNIV-UT3 UT3-TOULOUSEINP

153 Consultations

238 Téléchargements

Large Margin GMM for discriminative speaker verifi cation

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager