DNN Uncertainty Propagation using GMM-Derived Uncertainty Features for Noise Robust ASR - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue IEEE Signal Processing Letters Année : 2018

DNN Uncertainty Propagation using GMM-Derived Uncertainty Features for Noise Robust ASR

Résumé

The uncertainty decoding framework is known to improve deep neural network (DNN) based automatic speech recognition (ASR) performance in noisy environments. It operates by estimating the statistical uncertainty about the input features and propagating it to the output senone posteriors by sampling. Unfortunately, this approximate propagation scheme limits the performance improvement. In this work, we exploit the fact that uncertainty propagation can be achieved in closed form for Gaussian mixture acoustic models (GMMs). We introduce new GMM-derived (GMMD) uncertainty features for robust DNN-based acoustic model training and decoding. The GMMD features are computed as the difference between the GMM log-likelihoods obtained with vs. without uncertainty. They are concatenated with conventional acoustic features and used as inputs to the DNN. We evaluate the resulting ASR performance on the CHiME-2 and CHiME-3 datasets. The proposed features are shown to improve performance on both datasets, both for conventional decoding and for uncertainty decoding with different uncertainty estimation/propagation techniques.
Fichier principal
Vignette du fichier
nathwani_SPL18.pdf (788.17 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01680658 , version 1 (11-01-2018)

Identifiants

Citer

Karan Nathwani, Emmanuel Vincent, Irina Illina. DNN Uncertainty Propagation using GMM-Derived Uncertainty Features for Noise Robust ASR. IEEE Signal Processing Letters, 2018, ⟨10.1109/LSP.2018.2791534⟩. ⟨hal-01680658⟩
242 Consultations
426 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More