Using full-rank spatial covariance models for noise-robust ASR

Dung T. Tran; Emmanuel Vincent; Denis Jouvet; Kamil Adiloglu

Communication Dans Un Congrès Année : 2013

Using full-rank spatial covariance models for noise-robust ASR

(1) , (1) , (1) , (2)

1
2

Dung T. Tran

Fonction : Auteur

Analysis, perception and recognition of speech

Emmanuel Vincent

Fonction : Auteur
PersonId : 1256
IdHAL : emmanuelv
ORCID : 0000-0002-0183-7289
IdRef : 089360176

Analysis, perception and recognition of speech

Denis Jouvet

Fonction : Auteur
PersonId : 15904
IdHAL : denis-jouvet
IdRef : 029418666

Analysis, perception and recognition of speech

Kamil Adiloglu

Fonction : Auteur

Hörtech gGmbH

Résumé

We present a joint spatial and spectral denoising front-end for Track 1 of the 2nd CHiME Speech Separation and Recognition Challenge based on the Flexible Audio Source Separation Toolbox (FASST). We represent the sources by nonnegative matrix factorization (NMF) and full-rank spatial covariances, which are known to be appropriate for the modeling of small source movements. We then learn acoustic models for automatic speech recognition (ASR) on the enhanced training data. We obtain 40% average error rate reduction due to speech separation compared to multicondition training alone.

Mots clés

speech separation FASST noise-robust speech recognition

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

tran_CHiME13.pdf (42.02 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Emmanuel Vincent : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00801162

Soumis le : vendredi 15 mars 2013-10:59:07

Dernière modification le : jeudi 1 février 2024-10:06:10

Archivage à long terme le : lundi 17 juin 2013-14:22:18

Dates et versions

hal-00801162 , version 1 (15-03-2013)

Identifiants

HAL Id : hal-00801162 , version 1

Citer

Dung T. Tran, Emmanuel Vincent, Denis Jouvet, Kamil Adiloglu. Using full-rank spatial covariance models for noise-robust ASR. CHiME - 2nd International Workshop on Machine Listening in Multisource Environments - 2013, Jun 2013, Vancouver, Canada. pp.31-32. ⟨hal-00801162⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA GRID5000 UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES SILECS UR1-MATH-NUM

259 Consultations

330 Téléchargements

Using full-rank spatial covariance models for noise-robust ASR

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager