Skip to Main content Skip to Navigation
Conference papers

Probabilistic Speaker Pronunciation Adaptation for Spontaneous Speech Synthesis Using Linguistic Features

Raheel Qader 1 Gwénolé Lecorvé 1 Damien Lolive 1 Pascale Sébillot 2
1 EXPRESSION - Expressiveness in Human Centered Data/Media
UBS - Université de Bretagne Sud, IRISA-D6 - MEDIA ET INTERACTIONS
2 LinkMedia - Creating and exploiting explicit links between multimedia fragments
Inria Rennes – Bretagne Atlantique , IRISA-D6 - MEDIA ET INTERACTIONS
Abstract : Pronunciation adaptation consists in predicting pronunciation variants of words and utterances based on their standard pronunciation and a target style. This is a key issue in text-to-speech as those variants bring expressiveness to synthetic speech, especially when considering a spontaneous style. This paper presents a new pronunciation adaptation method which adapts standard pronunciations to the style of individual speakers in a context of spontaneous speech. Its originality and strength are to solely rely on linguistic features and to consider a probabilistic machine learning framework, namely conditional random fields, to produce the adapted pronunciations. Features are first selected in a series of experiments, then combined to produce the final adaptation method. Backend experiments on the Buckeye conversational English speech corpus show that adapted pronunciations significantly better reflect spontaneous speech than standard ones, and that even better could be achieved if considering alternative predictions.
Complete list of metadata

Cited literature [19 references]  Display  Hide  Download
Contributor : Gwénolé Lecorvé Connect in order to contact the contributor
Submitted on : Friday, October 16, 2015 - 10:23:07 AM
Last modification on : Thursday, January 20, 2022 - 5:33:10 PM
Long-term archiving on: : Thursday, April 27, 2017 - 12:17:36 AM


Files produced by the author(s)


  • HAL Id : hal-01181192, version 1


Raheel Qader, Gwénolé Lecorvé, Damien Lolive, Pascale Sébillot. Probabilistic Speaker Pronunciation Adaptation for Spontaneous Speech Synthesis Using Linguistic Features. International Conference on Statistical Language and Speech Processing (SLSP), Nov 2015, Budapest, Hungary. pp.229-241. ⟨hal-01181192⟩



Les métriques sont temporairement indisponibles