DNN Uncertainty Propagation using GMM-Derived Uncertainty Features for Noise Robust ASR - Archive ouverte HAL Access content directly
Journal Articles IEEE Signal Processing Letters Year : 2018

DNN Uncertainty Propagation using GMM-Derived Uncertainty Features for Noise Robust ASR

(1) , (2) , (2)
1
2

Abstract

The uncertainty decoding framework is known to improve deep neural network (DNN) based automatic speech recognition (ASR) performance in noisy environments. It operates by estimating the statistical uncertainty about the input features and propagating it to the output senone posteriors by sampling. Unfortunately, this approximate propagation scheme limits the performance improvement. In this work, we exploit the fact that uncertainty propagation can be achieved in closed form for Gaussian mixture acoustic models (GMMs). We introduce new GMM-derived (GMMD) uncertainty features for robust DNN-based acoustic model training and decoding. The GMMD features are computed as the difference between the GMM log-likelihoods obtained with vs. without uncertainty. They are concatenated with conventional acoustic features and used as inputs to the DNN. We evaluate the resulting ASR performance on the CHiME-2 and CHiME-3 datasets. The proposed features are shown to improve performance on both datasets, both for conventional decoding and for uncertainty decoding with different uncertainty estimation/propagation techniques.
Fichier principal
Vignette du fichier
nathwani_SPL18.pdf (788.17 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01680658 , version 1 (11-01-2018)

Identifiers

Cite

Karan Nathwani, Emmanuel Vincent, Irina Illina. DNN Uncertainty Propagation using GMM-Derived Uncertainty Features for Noise Robust ASR. IEEE Signal Processing Letters, 2018, ⟨10.1109/LSP.2018.2791534⟩. ⟨hal-01680658⟩
229 View
389 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More