Using full-rank spatial covariance models for noise-robust ASR - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

Using full-rank spatial covariance models for noise-robust ASR

Résumé

We present a joint spatial and spectral denoising front-end for Track 1 of the 2nd CHiME Speech Separation and Recognition Challenge based on the Flexible Audio Source Separation Toolbox (FASST). We represent the sources by nonnegative matrix factorization (NMF) and full-rank spatial covariances, which are known to be appropriate for the modeling of small source movements. We then learn acoustic models for automatic speech recognition (ASR) on the enhanced training data. We obtain 40% average error rate reduction due to speech separation compared to multicondition training alone.
Fichier principal
Vignette du fichier
tran_CHiME13.pdf (42.02 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00801162 , version 1 (15-03-2013)

Identifiants

  • HAL Id : hal-00801162 , version 1

Citer

Dung T. Tran, Emmanuel Vincent, Denis Jouvet, Kamil Adiloglu. Using full-rank spatial covariance models for noise-robust ASR. CHiME - 2nd International Workshop on Machine Listening in Multisource Environments - 2013, Jun 2013, Vancouver, Canada. pp.31-32. ⟨hal-00801162⟩
259 Consultations
328 Téléchargements

Partager

Gmail Facebook X LinkedIn More