DNN Uncertainty Propagation using GMM-Derived Uncertainty Features for Noise Robust ASR

Karan Nathwani 1 Emmanuel Vincent 2 Irina Illina 2
2 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : The uncertainty decoding framework is known to improve deep neural network (DNN) based automatic speech recognition (ASR) performance in noisy environments. It operates by estimating the statistical uncertainty about the input features and propagating it to the output senone posteriors by sampling. Unfortunately, this approximate propagation scheme limits the performance improvement. In this work, we exploit the fact that uncertainty propagation can be achieved in closed form for Gaussian mixture acoustic models (GMMs). We introduce new GMM-derived (GMMD) uncertainty features for robust DNN-based acoustic model training and decoding. The GMMD features are computed as the difference between the GMM log-likelihoods obtained with vs. without uncertainty. They are concatenated with conventional acoustic features and used as inputs to the DNN. We evaluate the resulting ASR performance on the CHiME-2 and CHiME-3 datasets. The proposed features are shown to improve performance on both datasets, both for conventional decoding and for uncertainty decoding with different uncertainty estimation/propagation techniques.
Document type :
Journal articles
Complete list of metadatas

Cited literature [33 references]  Display  Hide  Download

https://hal.inria.fr/hal-01680658
Contributor : Emmanuel Vincent <>
Submitted on : Thursday, January 11, 2018 - 1:47:35 AM
Last modification on : Wednesday, April 3, 2019 - 1:22:59 AM
Long-term archiving on: Thursday, May 3, 2018 - 4:38:40 PM

File

nathwani_SPL18.pdf
Files produced by the author(s)

Identifiers

Citation

Karan Nathwani, Emmanuel Vincent, Irina Illina. DNN Uncertainty Propagation using GMM-Derived Uncertainty Features for Noise Robust ASR. IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers, 2018, ⟨10.1109/LSP.2018.2791534⟩. ⟨hal-01680658⟩

Share

Metrics

Record views

318

Files downloads

446