A comparison of different methods for noise adaptation in a HMM-based speech recognition system

Christophe Cerisara 1 Dominique Fohr 1 Irina Illina 1 Fabrice Lauri 1 Odile Mella 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : Hidden Markov models (HMMs) have been successfully applied in speech recognition, but their performances dramatically drop in noisy conditions. This paper presents a comparison of different methods to increase the robustness of an HMM automatic speech recognition system. We have evaluated two types of approaches: the first one estimates a transformation from a few noisy sentences to adapt the initial models trained in clean speech. The second one tries to remove the noise from the signal without modifying the HMM models. We have compared the following methods: Parallel Model Combination(PMC) Maximum A Posteriori(MAP) Maximum Likelihood Linear Regression(MLLR) Multivariate-Gaussian-based Cepstral Normalization(RATZ) Vector Taylor Series(VTS) Spectral subtraction. Tests have been conducted on the noisy database of a voice command task: a multi-speaker navigation system using a limited vocabulary. On a subset of the database with a SNR of 10 dB, we have obtained the following results: baseline system:85% PMC:93% RATZ:91% MLLR:93% In the full paper, we will give the results for all the methods and all the noisy conditions and we will discuss the advantages and drawbacks of each method regarding the real time capabilities and the size of the adaptation set.
Type de document :
Communication dans un congrès
International Congress on Acoustics, 2001, Italy, Rome, 2 p, 2001
Liste complète des métadonnées

https://hal.inria.fr/inria-00101105
Contributeur : Publications Loria <>
Soumis le : mardi 26 septembre 2006 - 14:56:31
Dernière modification le : vendredi 9 février 2018 - 13:20:05

Identifiants

  • HAL Id : inria-00101105, version 1

Collections

Citation

Christophe Cerisara, Dominique Fohr, Irina Illina, Fabrice Lauri, Odile Mella. A comparison of different methods for noise adaptation in a HMM-based speech recognition system. International Congress on Acoustics, 2001, Italy, Rome, 2 p, 2001. 〈inria-00101105〉

Partager

Métriques

Consultations de la notice

215