Discriminative speaker recognition using Large Margin GMM

Abstract : Most state-of-the-art speaker recognition systems are based on discriminative learning approaches. On the other hand, generative Gaussian mixture models (GMM) have been widely used in speaker recognition during the last decades. In an earlier work, we proposed an algorithm for discriminative training of GMM with diagonal covariances under a large margin criterion. In this paper, we propose an improvement of this algorithm which has the major advantage of being computationally highly efficient, thus well suited to handle large scale databases. We also develop a new strategy to detect and handle the outliers that occur in the training data. To evaluate the performances of our new algorithm, we carry out full NIST speaker identification and verification tasks using NIST-SRE'2006 data, in a Symmetrical Factor Analysis compensation scheme. The results show that our system significantly outperforms the traditional discriminative Support Vector Machines (SVM) based system of SVM-GMM supervectors, in the two speaker recognition tasks.
Type de document :
Article dans une revue
Neural Computing and Applications, Springer Verlag, 2012, 〈10.1007/s00521-012-1079-y〉
Liste complète des métadonnées

Littérature citée [20 références]  Voir  Masquer  Télécharger

Contributeur : Khalid Daoudi <>
Soumis le : vendredi 9 novembre 2012 - 16:46:26
Dernière modification le : jeudi 11 janvier 2018 - 06:21:34
Document(s) archivé(s) le : dimanche 10 février 2013 - 04:20:09


Fichiers produits par l'(les) auteur(s)




Reda Jourani, Khalid Daoudi, Régine André-Obrecht, Driss Aboutajdine. Discriminative speaker recognition using Large Margin GMM. Neural Computing and Applications, Springer Verlag, 2012, 〈10.1007/s00521-012-1079-y〉. 〈hal-00750385〉



Consultations de la notice


Téléchargements de fichiers