Uncertainty propagation through deep neural networks

Ahmed Hussen Abdelaziz; Shinji Watanabe; John R. Hershey; Emmanuel Vincent; Dorothea Kolossa

Communication Dans Un Congrès Année : 2015

Uncertainty propagation through deep neural networks

(1) , (2) , (2) , (3) , (1)

1
2
3

Ahmed Hussen Abdelaziz

Fonction : Auteur

Ruhr University Bochum = Ruhr-Universität Bochum

Shinji Watanabe

Fonction : Auteur

Mitsubishi Electric Research Laboratories

John R. Hershey

Fonction : Auteur

Mitsubishi Electric Research Laboratories

Emmanuel Vincent

Fonction : Auteur
PersonId : 1256
IdHAL : emmanuelv
ORCID : 0000-0002-0183-7289
IdRef : 089360176

Speech Modeling for Facilitating Oral-Based Communication

Dorothea Kolossa

Fonction : Auteur

Ruhr University Bochum = Ruhr-Universität Bochum

Résumé

In order to improve the ASR performance in noisy environments , distorted speech is typically pre-processed by a speech enhancement algorithm, which usually results in a speech estimate containing residual noise and distortion. We may also have some measures of uncertainty or variance of the estimate. Uncertainty decoding is a framework that utilizes this knowledge of uncertainty in the input features during acoustic model scoring. Such frameworks have been well explored for traditional probabilistic models, but their optimal use for deep neural network (DNN)-based ASR systems is not yet clear. In this paper, we study the propagation of observation uncertainties through the layers of a DNN-based acoustic model. Since this is intractable due to the nonlinearities of the DNN, we employ approximate propagation methods, including Monte Carlo sampling , the unscented transform, and the piecewise exponential approximation of the activation function, to estimate the distribution of acoustic scores. Finally, the expected value of the acoustic score distribution is used for decoding, which is shown to further improve the ASR accuracy on the CHiME database, relative to a highly optimized DNN baseline.

Mots clés

Noise-robust ASR Deep Neural Networks Observation Uncertainty Uncertainty Propagation

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

abdelaziz_IS15.pdf (215.8 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Emmanuel Vincent : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01162550

Soumis le : mercredi 10 juin 2015-18:06:48

Dernière modification le : jeudi 1 février 2024-10:04:25

Archivage à long terme le : mardi 25 avril 2017-06:31:56

Dates et versions

hal-01162550 , version 1 (10-06-2015)

Identifiants

HAL Id : hal-01162550 , version 1

Citer

Ahmed Hussen Abdelaziz, Shinji Watanabe, John R. Hershey, Emmanuel Vincent, Dorothea Kolossa. Uncertainty propagation through deep neural networks. Interspeech 2015, Sep 2015, Dresden, Germany. ⟨hal-01162550⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA GRID5000 UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES SILECS UR1-MATH-NUM

936 Consultations

2703 Téléchargements

Uncertainty propagation through deep neural networks

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager