Discriminative speaker recognition using Large Margin GMM

Reda Jourani; Khalid Daoudi; Régine André-Obrecht; Driss Aboutajdine

doi:10.1007/s00521-012-1079-y

Article Dans Une Revue Neural Computing and Applications Année : 2012

Discriminative speaker recognition using Large Margin GMM

(1) , (2) , (1, 3) , (4)

1
2
3
4

Reda Jourani

Fonction : Auteur
PersonId : 881619
IdRef : 165708018

Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio

Khalid Daoudi

Fonction : Auteur
PersonId : 1329075
ORCID : 0000-0003-3536-1060
IdRef : 115483500

Geometry and Statistics in acquisition data

Régine André-Obrecht

Fonction : Auteur
PersonId : 740810
IdHAL : obrecht
IdRef : 060375965

Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio

Université Toulouse III - Paul Sabatier

Driss Aboutajdine

Fonction : Auteur
PersonId : 906382

Laboratoire de Recherche en Informatique et Télécommunications [Rabat]

Résumé

Most state-of-the-art speaker recognition systems are based on discriminative learning approaches. On the other hand, generative Gaussian mixture models (GMM) have been widely used in speaker recognition during the last decades. In an earlier work, we proposed an algorithm for discriminative training of GMM with diagonal covariances under a large margin criterion. In this paper, we propose an improvement of this algorithm which has the major advantage of being computationally highly efficient, thus well suited to handle large scale databases. We also develop a new strategy to detect and handle the outliers that occur in the training data. To evaluate the performances of our new algorithm, we carry out full NIST speaker identification and verification tasks using NIST-SRE'2006 data, in a Symmetrical Factor Analysis compensation scheme. The results show that our system significantly outperforms the traditional discriminative Support Vector Machines (SVM) based system of SVM-GMM supervectors, in the two speaker recognition tasks.

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

PAPER.pdf (166.8 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Khalid Daoudi : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00750385

Soumis le : vendredi 9 novembre 2012-16:46:26

Dernière modification le : vendredi 2 février 2024-03:34:21

Archivage à long terme le : dimanche 10 février 2013-04:20:09

Dates et versions

hal-00750385 , version 1 (09-11-2012)

Identifiants

HAL Id : hal-00750385 , version 1
DOI : 10.1007/s00521-012-1079-y

Citer

Reda Jourani, Khalid Daoudi, Régine André-Obrecht, Driss Aboutajdine. Discriminative speaker recognition using Large Margin GMM. Neural Computing and Applications, 2012, ⟨10.1007/s00521-012-1079-y⟩. ⟨hal-00750385⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLSE2 UNIV-RENNES1 CNRS INRIA IRISA UT1-CAPITOLE INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES IRIT IRIT-SAMOVA UR1-MATH-NUM IRIT-SI TOULOUSE-INP UNIV-UT3 UT3-TOULOUSEINP

129 Consultations

434 Téléchargements

Discriminative speaker recognition using Large Margin GMM

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager