Multiple Pronunciation Generation using Grapheme-to-Phoneme Conversion based on Conditional Random Fields - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

Multiple Pronunciation Generation using Grapheme-to-Phoneme Conversion based on Conditional Random Fields

Irina Illina
Dominique Fohr
Denis Jouvet

Résumé

We propose an approach to grapheme-to-phoneme conversion with multiple pronunciations based on a probabilistic method: Conditional Random Fields (CRF). CRF give a long term prediction and assume relaxed state independence condition compared to HMMs. Moreover, we propose an algorithm to one-to-one letter to phoneme alignment needed for CRF training. This alignment is based on discrete HMM. This paper investigated the impact of the training set size and the multiple pronunciation generation. Validated on BDLex French dictionary, our approach compares favorably with the performance of the state-of-the-art Joint-Multigram Models in term of the quality of the pronunciations and in term of recall and precision measures for multiple pronunciation variants generation.
Fichier non déposé

Dates et versions

inria-00616325 , version 1 (22-08-2011)

Identifiants

  • HAL Id : inria-00616325 , version 1

Citer

Irina Illina, Dominique Fohr, Denis Jouvet. Multiple Pronunciation Generation using Grapheme-to-Phoneme Conversion based on Conditional Random Fields. XIV International Conference "Speech and Computer" (SPECOM'2011), Sep 2011, Kazan, Russia. ⟨inria-00616325⟩
213 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More