Extension of uncertainty propagation to dynamic MFCCs for noise robust ASR

Dung Tien Tran 1 Emmanuel Vincent 2 Denis Jouvet 2
1 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
2 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : Uncertainty propagation has been successfully employed for speech recognition in nonstationary noise environments. The uncertainty about the features is typically represented as a diagonal covariance matrix for static features only. We present a framework for estimating the uncertainty over both static and dynamic features as a full covariance matrix. The estimated covariance matrix is then multiplied by scaling coefficients optimized on development data. We achieve 21\% relative error rate reduction on the 2nd CHiME Challenge with respect to conventional decoding without uncertainty, that is five times more than the reduction achieved with diagonal uncertainty covariance for static features only.
Document type :
Conference papers
Complete list of metadatas

Cited literature [20 references]  Display  Hide  Download

https://hal.inria.fr/hal-00954654
Contributor : Dung Tran <>
Submitted on : Tuesday, March 11, 2014 - 3:55:32 PM
Last modification on : Saturday, March 30, 2019 - 1:26:27 AM
Long-term archiving on : Wednesday, June 11, 2014 - 12:51:23 PM

File

Icassp2014_extension.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00954654, version 2

Citation

Dung Tien Tran, Emmanuel Vincent, Denis Jouvet. Extension of uncertainty propagation to dynamic MFCCs for noise robust ASR. 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2014, Florence, Italy. ⟨hal-00954654v2⟩

Share

Metrics

Record views

758

Files downloads

459