Do not build your TTS training corpus randomly

Jonathan Chevelu; Damien Lolive

Communication Dans Un Congrès Année : 2015

Do not build your TTS training corpus randomly

(1) , (1)

Jonathan Chevelu

Fonction : Auteur
PersonId : 4560
IdHAL : jonathan-chevelu
IdRef : 156873885

Expressiveness in Human Centered Data/Media

Damien Lolive

Fonction : Auteur
PersonId : 5088
IdHAL : damien-lolive
ORCID : 0000-0002-1110-5444
IdRef : 13017498X

Expressiveness in Human Centered Data/Media

Résumé

TTS voice building generally relies on a script extracted from a big text corpus while optimizing the coverage of linguistic and phonological events supposedly related to voice acoustic quality. Previous works have shown differences on objective measures between smartly reduced and random corpora, but not when subjective evaluations are performed. For us, those results do not come from corpus reduction utility but from evaluations that smooth differences. In this article, we highlight those differences in a subjective test, by clustering test corpora according to a distance between signals so as to focus on different synthesized stimuli. The results show that covering appropriate features has a real impact on the perceived quality.

Mots clés

Corpus reduction Subjective evaluation Corpus-based Unit Selection TTS

Domaines

Intelligence artificielle [cs.AI] Traitement du signal et de l'image [eess.SP] Son [cs.SD] Interface homme-machine [cs.HC]

Damien Lolive : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01199083

Soumis le : lundi 14 septembre 2015-21:09:33

Dernière modification le : mardi 3 octobre 2023-09:49:30

Dates et versions

hal-01199083 , version 1 (14-09-2015)

Identifiants

HAL Id : hal-01199083 , version 1

Citer

Jonathan Chevelu, Damien Lolive. Do not build your TTS training corpus randomly. Proceedings of the European Signal Processing Conference (EUSIPCO), Aug 2015, Nice, France. ⟨hal-01199083⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM UNIV-RENNES1 CNRS INRIA INSA-RENNES ENSSAT IRISA CENTRALESUPELEC IRISA-D6 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

324 Consultations

0 Téléchargements

Do not build your TTS training corpus randomly

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager