hal-00660689, version 1
Uncertainty-based learning of Gaussian mixture models from noisy data
N° RR-7862 (2012)
Abstract: We consider the problem of Gaussian mixture model (GMM)-based classification of noisy data, where the uncertainty over the data is given by a Gaussian distribution. While this uncertainty is commonly exploited at the decoding stage via uncertainty decoding, it has not been exploited at the training stage so far. We introduce a new Expectation-Maximization (EM) algorithm called uncertainty training that allows to learn GMMs directly from noisy data while taking their uncertainty into account. We evaluate its potential for a speaker recognition task over speech data corrupted by real-world domestic background noise, using a state-of-the-art signal enhancement technique and various uncertainty estimation techniques as a front-end. Compared to conventional training, the proposed algorithm results in 3\% to 4\% absolute improvement in speaker recognition accuracy by training from either matched, unmatched or multi-condition noisy data. This algorithm is also applicable with minor modifications to maximum a posteriori (MAP) or maximum likelihood linear regression (MLLR) model adaptation and to the training of hidden Markov models (HMMs) from noisy data.
- 1:
- Technicolor
- 2:
- IRCAM
- 3:
- CNRS : UMR6074 – INRIA – Institut National des Sciences Appliquées (INSA) - Rennes – Université de Rennes 1
- Domain : Computer Science/Signal and Image Processing
Engineering Sciences/Signal and Image processing - Internal note : RR-7862
- Available versions : v1 (2012-01-17) v2 (2012-07-16)
- hal-00660689, version 1
- http://hal.inria.fr/hal-00660689
- oai:hal.inria.fr:hal-00660689
- From:
- Submitted on: Tuesday, 17 January 2012 13:59:28
- Updated on: Tuesday, 17 January 2012 14:09:34





Associated documents
Export