Large Margin Gaussian mixture models for speaker identification

Reda Jourani; Khalid Daoudi; Régine André-Obrecht; Driss Aboutajdine

Communication Dans Un Congrès Année : 2010

Large Margin Gaussian mixture models for speaker identification

(1) , (1) , (2, 3) , (4)

1
2
3
4

Reda Jourani

Fonction : Auteur
PersonId : 881619
IdRef : 165708018

Geometry and Statistics in acquisition data

Khalid Daoudi

Fonction : Auteur
PersonId : 1329075
ORCID : 0000-0003-3536-1060
IdRef : 115483500

Geometry and Statistics in acquisition data

Régine André-Obrecht

Fonction : Auteur
PersonId : 740810
IdHAL : obrecht
IdRef : 060375965

Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio

Université Toulouse III - Paul Sabatier

Driss Aboutajdine

Fonction : Auteur

Laboratoire de Recherche en Informatique et Télécommunications [Rabat]

Résumé

Gaussian mixture models (GMM) have been widely and successfully used in speaker recognition during the last decade. However, they are generally trained using the generative criterion of maximum likelihood estimation. In this paper, we propose a simple and efficient discriminative approach to learn GMM with a large margin criterion to solve the classification problem. Our approach is based on a recent work about the Large Margin GMM (LM-GMM) where each class is modeled by a mixture of ellipsoids and which has shown good results in speech recognition. We propose a simplification of the original algorithm and carry out preliminary experiments on a speaker identification task using NIST-SRE'2006 data. We compare the traditional generative GMM approach, the original LM-GMM one and our own version. The results suggest that our algorithm outperforms the two others.

Mots clés

large margin learning GMM-UBM speaker recognition discriminative learning

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Khalid Daoudi : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00532781

Soumis le : jeudi 4 novembre 2010-13:53:11

Dernière modification le : vendredi 2 février 2024-03:34:21

Dates et versions

inria-00532781 , version 1 (04-11-2010)

Identifiants

HAL Id : inria-00532781 , version 1

Citer

Reda Jourani, Khalid Daoudi, Régine André-Obrecht, Driss Aboutajdine. Large Margin Gaussian mixture models for speaker identification. Interspeech, Sep 2010, Makuhari, Japan. ⟨inria-00532781⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLSE2 UNIV-RENNES1 CNRS INRIA IRISA UT1-CAPITOLE INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES IRIT IRIT-SAMOVA UR1-MATH-NUM IRIT-SI TOULOUSE-INP UNIV-UT3 UT3-TOULOUSEINP

149 Consultations

0 Téléchargements

Large Margin Gaussian mixture models for speaker identification

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager