Large Margin Gaussian mixture models for speaker identification

Abstract : Gaussian mixture models (GMM) have been widely and successfully used in speaker recognition during the last decade. However, they are generally trained using the generative criterion of maximum likelihood estimation. In this paper, we propose a simple and efficient discriminative approach to learn GMM with a large margin criterion to solve the classification problem. Our approach is based on a recent work about the Large Margin GMM (LM-GMM) where each class is modeled by a mixture of ellipsoids and which has shown good results in speech recognition. We propose a simplification of the original algorithm and carry out preliminary experiments on a speaker identification task using NIST-SRE'2006 data. We compare the traditional generative GMM approach, the original LM-GMM one and our own version. The results suggest that our algorithm outperforms the two others.
Complete list of metadatas

https://hal.inria.fr/inria-00532781
Contributor : Khalid Daoudi <>
Submitted on : Thursday, November 4, 2010 - 1:53:11 PM
Last modification on : Friday, January 10, 2020 - 9:09:11 PM

Identifiers

  • HAL Id : inria-00532781, version 1

Citation

Reda Jourani, Khalid Daoudi, Régine André-Obrecht, Driss Aboutajdine. Large Margin Gaussian mixture models for speaker identification. Interspeech, Sep 2010, Makuhari, Japan. ⟨inria-00532781⟩

Share

Metrics

Record views

351