8485 articles  [version française]

hal-00660689, version 1

Uncertainty-based learning of Gaussian mixture models from noisy data

Alexey Ozerov 1, Mathieu Lagrange 2, Emmanuel Vincent () 3

N° RR-7862 (2012)

Abstract: We consider the problem of Gaussian mixture model (GMM)-based classification of noisy data, where the uncertainty over the data is given by a Gaussian distribution. While this uncertainty is commonly exploited at the decoding stage via uncertainty decoding, it has not been exploited at the training stage so far. We introduce a new Expectation-Maximization (EM) algorithm called uncertainty training that allows to learn GMMs directly from noisy data while taking their uncertainty into account. We evaluate its potential for a speaker recognition task over speech data corrupted by real-world domestic background noise, using a state-of-the-art signal enhancement technique and various uncertainty estimation techniques as a front-end. Compared to conventional training, the proposed algorithm results in 3\% to 4\% absolute improvement in speaker recognition accuracy by training from either matched, unmatched or multi-condition noisy data. This algorithm is also applicable with minor modifications to maximum a posteriori (MAP) or maximum likelihood linear regression (MLLR) model adaptation and to the training of hidden Markov models (HMMs) from noisy data.

  • 1:  Technicolor R & I
  • Technicolor
  • 2:  Institut de Recherche et Coordination Acoustique/Musique (IRCAM)
  • IRCAM
  • 3:  METISS (INRIA - IRISA)
  • CNRS : UMR6074 – INRIA – Institut National des Sciences Appliquées (INSA) - Rennes – Université de Rennes 1
  • Domain : Computer Science/Signal and Image Processing
    Engineering Sciences/Signal and Image processing
  • Internal note : RR-7862
  • Available versions :  v1 (2012-01-17) v2 (2012-07-16)
 
  • hal-00660689, version 1
  • oai:hal.inria.fr:hal-00660689
  • From: 
  • Submitted on: Tuesday, 17 January 2012 13:59:28
  • Updated on: Tuesday, 17 January 2012 14:09:34