Multiple Pronunciation Generation using Grapheme-to-Phoneme Conversion based on Conditional Random Fields - Archive ouverte HAL Access content directly
Conference Papers Year : 2011

Multiple Pronunciation Generation using Grapheme-to-Phoneme Conversion based on Conditional Random Fields

(1) , (1) , (1)
1
Irina Illina
Dominique Fohr
Denis Jouvet

Abstract

We propose an approach to grapheme-to-phoneme conversion with multiple pronunciations based on a probabilistic method: Conditional Random Fields (CRF). CRF give a long term prediction and assume relaxed state independence condition compared to HMMs. Moreover, we propose an algorithm to one-to-one letter to phoneme alignment needed for CRF training. This alignment is based on discrete HMM. This paper investigated the impact of the training set size and the multiple pronunciation generation. Validated on BDLex French dictionary, our approach compares favorably with the performance of the state-of-the-art Joint-Multigram Models in term of the quality of the pronunciations and in term of recall and precision measures for multiple pronunciation variants generation.
Not file

Dates and versions

inria-00616325 , version 1 (22-08-2011)

Identifiers

  • HAL Id : inria-00616325 , version 1

Cite

Irina Illina, Dominique Fohr, Denis Jouvet. Multiple Pronunciation Generation using Grapheme-to-Phoneme Conversion based on Conditional Random Fields. XIV International Conference "Speech and Computer" (SPECOM'2011), Sep 2011, Kazan, Russia. ⟨inria-00616325⟩
198 View
0 Download

Share

Gmail Facebook Twitter LinkedIn More