Combination of SVM and Large Margin GMM modeling for speaker identification

Abstract : Most state-of-the-art speaker recognition systems are partially or completely based on Gaussian mixture models (GMM). GMM have been widely and successfully used in speaker recognition during the last decades. They are traditionally estimated from a world model using the generative criterion of Maximum A Posteriori. In an earlier work, we proposed an efficient algorithm for discriminative learning of GMM with diagonal covariances under a large margin criterion. In this paper, we evaluate the combination of the large margin GMM modeling approach with SVM in the setting of speaker identification. We carry out a full NIST speaker identification task using NIST-SRE'2006 data, in a Symmetrical Factor Analysis compensation scheme. The results show that the two modeling approaches are complementary and that their combination outperforms their single use.
Complete list of metadatas

Cited literature [21 references]  Display  Hide  Download

https://hal.inria.fr/hal-00908372
Contributor : Khalid Daoudi <>
Submitted on : Friday, November 22, 2013 - 5:16:40 PM
Last modification on : Monday, January 13, 2020 - 10:04:03 AM
Long-term archiving on: Sunday, February 23, 2014 - 4:31:10 AM

File

eusipco13paper.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00908372, version 1

Citation

Reda Jourani, Khalid Daoudi, Régine André-Obrecht, Driss Aboutajdine. Combination of SVM and Large Margin GMM modeling for speaker identification. Eusipco, Sep 2013, Marrakech, Morocco. ⟨hal-00908372⟩

Share

Metrics

Record views

748

Files downloads

1126