Skip to Main content Skip to Navigation
New interface
Conference papers

Do not build your TTS training corpus randomly

Jonathan Chevelu 1 Damien Lolive 1 
1 EXPRESSION - Expressiveness in Human Centered Data/Media
UBS - Université de Bretagne Sud, IRISA-D6 - MEDIA ET INTERACTIONS
Abstract : TTS voice building generally relies on a script extracted from a big text corpus while optimizing the coverage of linguistic and phonological events supposedly related to voice acoustic quality. Previous works have shown differences on objective measures between smartly reduced and random corpora, but not when subjective evaluations are performed. For us, those results do not come from corpus reduction utility but from evaluations that smooth differences. In this article, we highlight those differences in a subjective test, by clustering test corpora according to a distance between signals so as to focus on different synthesized stimuli. The results show that covering appropriate features has a real impact on the perceived quality.
Complete list of metadata
Contributor : Damien Lolive Connect in order to contact the contributor
Submitted on : Monday, September 14, 2015 - 9:09:33 PM
Last modification on : Tuesday, October 19, 2021 - 11:58:58 PM


  • HAL Id : hal-01199083, version 1


Jonathan Chevelu, Damien Lolive. Do not build your TTS training corpus randomly. Proceedings of the European Signal Processing Conference (EUSIPCO), Aug 2015, Nice, France. ⟨hal-01199083⟩



Record views