Nonparametric uncertainty estimation and propagation for noise robust ASR - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue IEEE/ACM Transactions on Audio, Speech and Language Processing Année : 2015

Nonparametric uncertainty estimation and propagation for noise robust ASR

Résumé

We consider the framework of uncertainty propagation for automatic speech recognition (ASR) in highly non-stationary noise environments. Uncertainty is considered as the variance of speech distortion. Yet, its accurate estimation in the spectral domain and its propagation to the feature domain remain difficult. Existing methods typically rely on a single uncertainty estimator and propagator fixed by mathematical approximation. In this paper, we propose a new paradigm where we seek to learn more powerful mappings to predict uncertainty from data. We investigate two such possible mappings: linear fusion of multiple uncertainty estimators/propagators and nonparametric uncertainty estimation/propagation. In addition, a procedure to propagate the estimated spectral-domain uncertainty to the static Mel frequency cepstral coefficients (MFCCs), to the log-energy, and to their first- and second-order time derivatives is proposed. This results in a full uncertainty covariance matrix over both static and dynamic MFCCs. Experimental evaluation on Tracks 1 and 2 of the 2nd CHiME Challenge resulted in up to 29% and 28% relative keyword error rate reduction with respect to speech enhancement alone.
Fichier principal
Vignette du fichier
FinalVersion.pdf (409.93 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01114329 , version 1 (09-02-2015)
hal-01114329 , version 2 (17-07-2015)

Identifiants

Citer

Dung T. Tran, Emmanuel Vincent, Denis Jouvet. Nonparametric uncertainty estimation and propagation for noise robust ASR. IEEE/ACM Transactions on Audio, Speech and Language Processing, 2015, 23 (11), pp.1835-1846. ⟨10.1109/TASLP.2015.2450497⟩. ⟨hal-01114329v2⟩
403 Consultations
546 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More