Fast training of Large Margin diagonal Gaussian mixture models for speaker identification - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

Fast training of Large Margin diagonal Gaussian mixture models for speaker identification

Résumé

Gaussian mixture models (GMM) have been widely and successfully used in speaker recognition during the last decades. They are generally trained using the generative criterion of maximum likelihood estimation. In an earlier work, we proposed an algorithm for discriminative training of GMM with diagonal covariances under a large margin criterion. In this paper, we present a new version of this algorithm which has the major advantage of being computationally highly efficient. The resulting algorithm is thus well suited to handle large scale databases. We carry out experiments on a speaker identification task using NIST-SRE'2006 data and compare our new algorithm to the baseline generative GMM using different GMM sizes. The results show that our system significantly outperforms the baseline GMM in all configurations, and with high computational efficiency.
Fichier principal
Vignette du fichier
SpeD-2011.pdf (498.33 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00647213 , version 1 (01-12-2011)

Identifiants

  • HAL Id : hal-00647213 , version 1

Citer

Reda Jourani, Khalid Daoudi, Régine André-Obrecht, Driss Aboutajdine. Fast training of Large Margin diagonal Gaussian mixture models for speaker identification. International Conference on Speech Technology and Human-Computer Dialogue (SpeD), May 2011, Brasov, Romania. ⟨hal-00647213⟩
248 Consultations
427 Téléchargements

Partager

Gmail Facebook X LinkedIn More