Fusion of Multiple Uncertainty Estimators and Propagators for Noise Robust ASR - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

Fusion of Multiple Uncertainty Estimators and Propagators for Noise Robust ASR

Résumé

Uncertainty decoding has been successfully used for speech recognition in highly nonstationary noise environments. Yet, accurate estimation of the uncertainty on the denoised signals and propagation to the features remain difficult. In this work, we propose to fuse the uncertainty estimates obtained from different uncertainty estimators and propagators by linear combination. The fusion coefficients are optimized by minimizing a measure of divergence with oracle estimates on development data. Using the Kullback-Leibler divergence, we obtain 18\% relative error rate reduction on the 2nd CHiME Challenge with respect to conventional decoding, that is about twice as much as the reduction achieved by the best single uncertainty estimator and propagator.

Domaines

Son [cs.SD]
Fichier principal
Vignette du fichier
Icassp2014_fusion.pdf (91.94 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00955185 , version 1 (04-03-2014)
hal-00955185 , version 2 (11-03-2014)

Identifiants

  • HAL Id : hal-00955185 , version 2

Citer

Dung Tien Tran, Denis Jouvet, Emmanuel Vincent. Fusion of Multiple Uncertainty Estimators and Propagators for Noise Robust ASR. 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2014, Florence, Italy. ⟨hal-00955185v2⟩
481 Consultations
351 Téléchargements

Partager

Gmail Facebook X LinkedIn More