On using units trained on foreign data for improved multiple accent speech recognition

Katarina Bartkova; Denis Jouvet

doi:10.1016/j.specom.2006.12.009

Article Dans Une Revue Speech Communication Année : 2007

On using units trained on foreign data for improved multiple accent speech recognition

(1) , (1)

Katarina Bartkova

Fonction : Auteur

France Télécom R&D

Denis Jouvet

Fonction : Auteur
PersonId : 15904
IdHAL : denis-jouvet
IdRef : 029418666

France Télécom R&D

Résumé

Foreign accented speech recognition systems have to deal with the acoustic realization of sounds produced by non-native speakers that does not always match with native speech models. As the standard native speech modeling alone is generally not adequate, it is usually extended with models of phonemes estimated from speech data of foreign languages, and often complemented with extra pronunciation variants. In this paper, the focus is set on the speech recognition of multiple non-native accents. The speech corpus used was recorded from speakers originated from 24 different countries. The introduction of models of phonemes of the target language adapted on foreign speech data is presented and detailed. For the recognition of non-native speech comprising multiple foreign accents, this approach provides better performance than the introduction of standard foreign units. The selection of the most frequent acoustic variants for each phoneme is also discussed as this method makes recognition results more homogenous across speaker language groups. Furthermore, the adaptation of the acoustic models on non-native speech data is studied. Results show that detailed models, which include the modeling of extra pronunciation variants through acoustic units estimated on foreign data, benefit more from the task and accent adaptation process than baseline standard models used for native speech recognition. In addition, experiments show that an adaptation of the acoustic models on a limited set of foreign accents provides speech recognition performance improvements even on foreign accents absent from the adaptation data.

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Denis Jouvet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00616501

Soumis le : lundi 22 août 2011-17:30:19

Dernière modification le : mercredi 8 novembre 2017-18:46:02

Dates et versions

inria-00616501 , version 1 (22-08-2011)

Identifiants

HAL Id : inria-00616501 , version 1
DOI : 10.1016/j.specom.2006.12.009

Citer

Katarina Bartkova, Denis Jouvet. On using units trained on foreign data for improved multiple accent speech recognition. Speech Communication, 2007, 49 (10-11), pp.836-846. ⟨10.1016/j.specom.2006.12.009⟩. ⟨inria-00616501⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

32 Consultations

0 Téléchargements

On using units trained on foreign data for improved multiple accent speech recognition

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Altmetric

Partager