Skip to Main content Skip to Navigation
Journal articles

Discriminative speaker recognition using Large Margin GMM

Abstract : Most state-of-the-art speaker recognition systems are based on discriminative learning approaches. On the other hand, generative Gaussian mixture models (GMM) have been widely used in speaker recognition during the last decades. In an earlier work, we proposed an algorithm for discriminative training of GMM with diagonal covariances under a large margin criterion. In this paper, we propose an improvement of this algorithm which has the major advantage of being computationally highly efficient, thus well suited to handle large scale databases. We also develop a new strategy to detect and handle the outliers that occur in the training data. To evaluate the performances of our new algorithm, we carry out full NIST speaker identification and verification tasks using NIST-SRE'2006 data, in a Symmetrical Factor Analysis compensation scheme. The results show that our system significantly outperforms the traditional discriminative Support Vector Machines (SVM) based system of SVM-GMM supervectors, in the two speaker recognition tasks.
Complete list of metadata

Cited literature [20 references]  Display  Hide  Download
Contributor : Khalid Daoudi Connect in order to contact the contributor
Submitted on : Friday, November 9, 2012 - 4:46:26 PM
Last modification on : Tuesday, June 14, 2022 - 12:13:53 PM
Long-term archiving on: : Sunday, February 10, 2013 - 4:20:09 AM


Files produced by the author(s)



Reda Jourani, Khalid Daoudi, Régine André-Obrecht, Driss Aboutajdine. Discriminative speaker recognition using Large Margin GMM. Neural Computing and Applications, Springer Verlag, 2012, ⟨10.1007/s00521-012-1079-y⟩. ⟨hal-00750385⟩



Record views


Files downloads