Investigating Stranded GMM for Improving Automatic Speech Recognition

Arseniy Gorin 1 Denis Jouvet 1 Emmanuel Vincent 1 Dung Tran 1
1 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper investigates recently proposed Stranded Gaussian Mixture acoustic Model (SGMM) for Automatic Speech Recognition (ASR). This model extends conventional hidden Markov model (HMM-GMM) by explicitly introducing dependencies between components of the observation Gaussian mixture densities. The main objective of the paper is to experimentally study, how useful SGMM can be for dealing with data, which contains different sources of acoustic variability. First studied sources of variability are age and gender in quiet environment (TIdigits task including child speech). Second, the SGMM modeling is applied on data produced by different speakers and corrupted by non-stationary noise (CHiME 2013 challenge data). Finally, SGMM is applied on the same noisy data, but after performing speech enhancement (i.e., the remaining variability mostly comes from residual noise and different speakers). Although SGMM was originally proposed for robust speech recognition of noisy data, in this work it was found, that the model is more efficient for handling speaker variability in quiet environment.
Type de document :
Communication dans un congrès
4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA 2014), May 2014, Nancy, France. 2014
Liste complète des métadonnées

Littérature citée [21 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01003054
Contributeur : Arseniy Gorin <>
Soumis le : mardi 10 juin 2014 - 11:14:08
Dernière modification le : jeudi 11 janvier 2018 - 06:25:24
Document(s) archivé(s) le : mercredi 10 septembre 2014 - 11:30:21

Fichier

ago_HSCMA14_v5.2.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01003054, version 2

Citation

Arseniy Gorin, Denis Jouvet, Emmanuel Vincent, Dung Tran. Investigating Stranded GMM for Improving Automatic Speech Recognition. 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA 2014), May 2014, Nancy, France. 2014. 〈hal-01003054v2〉

Partager

Métriques

Consultations de la notice

480

Téléchargements de fichiers

231