Multiple Pronunciation Generation using Grapheme-to-Phoneme Conversion based on Conditional Random Fields

Irina Illina; Dominique Fohr; Denis Jouvet

Communication Dans Un Congrès Année : 2011

Multiple Pronunciation Generation using Grapheme-to-Phoneme Conversion based on Conditional Random Fields

(1) , (1) , (1)

Irina Illina

Fonction : Auteur
PersonId : 15663
IdHAL : irina-illina
IdRef : 120731746

Analysis, perception and recognition of speech

Dominique Fohr

Fonction : Auteur
PersonId : 15652
IdHAL : dominique-fohr
IdRef : 031092942

Analysis, perception and recognition of speech

Denis Jouvet

Fonction : Auteur
PersonId : 15904
IdHAL : denis-jouvet
IdRef : 029418666

Analysis, perception and recognition of speech

Résumé

We propose an approach to grapheme-to-phoneme conversion with multiple pronunciations based on a probabilistic method: Conditional Random Fields (CRF). CRF give a long term prediction and assume relaxed state independence condition compared to HMMs. Moreover, we propose an algorithm to one-to-one letter to phoneme alignment needed for CRF training. This alignment is based on discrete HMM. This paper investigated the impact of the training set size and the multiple pronunciation generation. Validated on BDLex French dictionary, our approach compares favorably with the performance of the state-of-the-art Joint-Multigram Models in term of the quality of the pronunciations and in term of recall and precision measures for multiple pronunciation variants generation.

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Denis Jouvet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00616325

Soumis le : lundi 22 août 2011-10:34:21

Dernière modification le : vendredi 24 mars 2023-14:52:54

Dates et versions

inria-00616325 , version 1 (22-08-2011)

Identifiants

HAL Id : inria-00616325 , version 1

Citer

Irina Illina, Dominique Fohr, Denis Jouvet. Multiple Pronunciation Generation using Grapheme-to-Phoneme Conversion based on Conditional Random Fields. XIV International Conference "Speech and Computer" (SPECOM'2011), Sep 2011, Kazan, Russia. ⟨inria-00616325⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA

213 Consultations

0 Téléchargements

Multiple Pronunciation Generation using Grapheme-to-Phoneme Conversion based on Conditional Random Fields

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager