Discriminative importance weighting of augmented training data for acoustic model training

Sunit Sivasankaran; Emmanuel Vincent; Irina Illina

Communication Dans Un Congrès Année : 2017

Discriminative importance weighting of augmented training data for acoustic model training

(1) , (1) , (1)

Sunit Sivasankaran

Fonction : Auteur

Speech Modeling for Facilitating Oral-Based Communication

Emmanuel Vincent

Fonction : Auteur
PersonId : 1256
IdHAL : emmanuelv
ORCID : 0000-0002-0183-7289
IdRef : 089360176

Speech Modeling for Facilitating Oral-Based Communication

Irina Illina

Fonction : Auteur
PersonId : 15663
IdHAL : irina-illina
IdRef : 120731746

Speech Modeling for Facilitating Oral-Based Communication

Résumé

DNN based acoustic models require a large amount of training data. Parametric data augmentation techniques such as adding noise, reverberation, or changing the speech rate, are often employed to boost the dataset size and the ASR performance. The choice of augmentation techniques and the associated parameters has been handled heuristically so far. In this work we propose an algorithm to automatically weight data perturbed using a variety of augmentation techniques and/or parameters. The weights are learned in a discriminative fashion so as to minimize the frame error rate using the standard gradient descent algorithm in an iterative manner. Experiments were performed using the CHiME-3 dataset. Data augmentation was done by adding noise at different SNRs. A relative WER improvement of 15% was obtained with the proposed data weighting algorithm compared to the unweighted augmented dataset. Interestingly, the resulting distribution of SNRs in the weighted training set differs significantly from that of the test set.

Mots clés

ASR data augmentation feature simulation DNN CHiME

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

sivasankaran_ICASSP17.pdf (192.39 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Emmanuel Vincent : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01415759

Soumis le : lundi 6 mars 2017-13:35:30

Dernière modification le : jeudi 1 février 2024-10:03:33

Archivage à long terme le : mercredi 7 juin 2017-13:54:19

Dates et versions

hal-01415759 , version 1 (13-12-2016)

hal-01415759 , version 2 (06-03-2017)

Identifiants

HAL Id : hal-01415759 , version 2

Citer

Sunit Sivasankaran, Emmanuel Vincent, Irina Illina. Discriminative importance weighting of augmented training data for acoustic model training. 42th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), Mar 2017, New Orleans, United States. ⟨hal-01415759v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA GRID5000 UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES SILECS UR1-MATH-NUM

611 Consultations

625 Téléchargements

Discriminative importance weighting of augmented training data for acoustic model training

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager