GMM-based classification from noisy features

Alexey Ozerov 1 Mathieu Lagrange 2 Emmanuel Vincent 1
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : We consider Gaussian mixture model (GMM)-based classification from noisy features, where the uncertainty over each feature is represented by a Gaussian distribution. For that purpose, we first propose a new GMM training and decoding criterion called log-likelihood integration which, as opposed to the conventional likelihood integration criterion, does not rely on any assumption regarding the distribution of the data. Secondly, we introduce two new Expectation Maximization (EM) algorithms for the two criteria, that allow to learn GMMs directly from noisy features. We then evaluate and compare the behaviors of two proposed algorithms with a categorization task on artificial data and speech data with additive artificial noise, assuming the uncertainty parameters are known. Experiments demonstrate the superiority of the likelihood integration criterion with the newly proposed EM learning in all tested configurations, thus giving rise to a new family of learning approaches that are insensitive to the heterogeneity of the noise characteristics between testing and training data.
Type de document :
Communication dans un congrès
International Workshop on Machine Listening in Multisource Environments (CHiME 2011), Sep 2011, Florence, Italy. 2011
Liste complète des métadonnées

Littérature citée [15 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00598742
Contributeur : Alexey Ozerov <>
Soumis le : mardi 7 juin 2011 - 14:36:20
Dernière modification le : mercredi 16 mai 2018 - 11:23:03
Document(s) archivé(s) le : vendredi 9 septembre 2011 - 15:17:23

Fichier

lagrangeChime11_v9.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00598742, version 1

Citation

Alexey Ozerov, Mathieu Lagrange, Emmanuel Vincent. GMM-based classification from noisy features. International Workshop on Machine Listening in Multisource Environments (CHiME 2011), Sep 2011, Florence, Italy. 2011. 〈inria-00598742〉

Partager

Métriques

Consultations de la notice

376

Téléchargements de fichiers

186