Evaluation of contextual descriptors for HMM-based speech synthesis in French - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

Evaluation of contextual descriptors for HMM-based speech synthesis in French

Nelly Barbot
Olivier Boëffard
  • Fonction : Auteur
  • PersonId : 883118

Résumé

In HTS, a HMM-based speech synthesis system, about fifty contextual factors are introduced to label a segment to synthesize English utterances. Published studies indicate that most of them are used for clustering the prosodic component of speech. Nevertheless, the influence of all these factors on modeling is still unclear for French. The work presented in this paper deals with the analysis of contextual factors on acoustic parameters modeling in the context of a French synthesis purpose. Two objective and one subjective methodologies of evaluation are carried out to conduct this study. The first one relies on a GMM-approach to achieve a global evaluation of the synthetic acoustic space. The second one is based on a pairwise distance determined according to the acoustic parameter evaluated. Finally, a subjective evaluation is conducted to complete this study. Experimental results show that using phonetic context improves the overall spectrum and duration modeling and using syllable informations improves the F0 modeling. However other contextual factors do not significantly improve the quality of the HTS models.

Domaines

Son [cs.SD]
Fichier non déposé

Dates et versions

hal-00987809 , version 1 (06-05-2014)

Identifiants

  • HAL Id : hal-00987809 , version 1

Citer

Sébastien Le Maguer, Nelly Barbot, Olivier Boëffard. Evaluation of contextual descriptors for HMM-based speech synthesis in French. ISCA Speech Synthesis Workshop (SSW8), Aug 2013, Barcelone, Spain. ⟨hal-00987809⟩
164 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More