Evaluating grapheme-to-phoneme converters in automatic speech recognition context

Denis Jouvet 1 Dominique Fohr 1 Irina Illina 1
1 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper deals with the evaluation of grapheme-to-phoneme (G2P) converters in a speech recognition context. The precision and recall rates are investigated as potential measures of the quality of the multiple generated pronunciation variants. Very different results are obtained whether or not we take into account the frequency of occurrence of the words. Since G2P systems are rarely evaluated on a speech recognition performance basis, the originality of this paper consists in using a speech recognition system to evaluate the G2P pronunciation variants. The results show that the training process is quite robust to some errors in the pronunciation lexicon, whereas pronunciation lexicon errors are harmful in the decoding process. Noticeable speech recognition performance improvements are achieved by combining two different G2P converters, one based on conditional random fields and the other on joint multigram models, as well as by checking the pronunciation variants of the most frequent words.
Type de document :
Communication dans un congrès
ICASSP - 2012 - IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Mar 2012, Kyoto, Japan. pp.4821 - 4824, 2012, 〈10.1109/ICASSP.2012.6288998〉
Liste complète des métadonnées

Littérature citée [14 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00753364
Contributeur : Denis Jouvet <>
Soumis le : lundi 14 septembre 2015 - 11:41:22
Dernière modification le : jeudi 11 janvier 2018 - 06:25:24
Document(s) archivé(s) le : mardi 29 décembre 2015 - 01:31:43

Fichier

EvalG2PinASRcontext-V1.1.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Denis Jouvet, Dominique Fohr, Irina Illina. Evaluating grapheme-to-phoneme converters in automatic speech recognition context. ICASSP - 2012 - IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Mar 2012, Kyoto, Japan. pp.4821 - 4824, 2012, 〈10.1109/ICASSP.2012.6288998〉. 〈hal-00753364〉

Partager

Métriques

Consultations de la notice

379

Téléchargements de fichiers

466