How to Compare TTS Systems: A New Subjective Evaluation Methodology Focused on Differences

Jonathan Chevelu; Damien Lolive; Sébastien Le Maguer; David Guennec

Communication Dans Un Congrès Année : 2015

How to Compare TTS Systems: A New Subjective Evaluation Methodology Focused on Differences

(1) , (1) , (2) , (1)

1
2

Jonathan Chevelu

Fonction : Auteur
PersonId : 4560
IdHAL : jonathan-chevelu
IdRef : 156873885

Expressiveness in Human Centered Data/Media

Damien Lolive

Fonction : Auteur
PersonId : 5088
IdHAL : damien-lolive
ORCID : 0000-0002-1110-5444
IdRef : 13017498X

Expressiveness in Human Centered Data/Media

Sébastien Le Maguer

Fonction : Auteur

Saarland University [Saarbrücken]

David Guennec

Fonction : Auteur
PersonId : 955117
IdHAL : 197707475
ORCID : 0009-0006-3265-6321

Expressiveness in Human Centered Data/Media

Résumé

Subjective evaluation is a crucial problem in the speech processing community and especially for the speech synthesis field, no matter what system is used. Indeed, when trying to assess the effectiveness of a proposed method, researchers usually conduct subjective evaluations by randomly choosing a small set of samples, from the same domain, taken from a baseline system and the proposed one. When selecting them randomly, statistically, samples with almost no differences are evaluated and the global measure is smoothed which may lead to judge the improvement not significant. To solve this methodological flaw, we propose to compare speech synthesis systems on thousands of generated samples from various domains and to focus subjective evaluations on the most relevant ones by computing a normalized alignment cost between sample pairs. This process has been successfully applied both in the HTS statistical framework and in the corpusbased approach. We have conducted two perceptive experiments by generating more than 27,000 samples for each system under comparison. A comparison between tests involving most different samples and randomly chosen samples shows clearly that the proposed approach

Mots clés

speech synthesis subjective evaluation

Domaines

Interface homme-machine [cs.HC] Intelligence artificielle [cs.AI] Traitement du signal et de l'image [eess.SP] Son [cs.SD]

Damien Lolive : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01199082

Soumis le : lundi 14 septembre 2015-21:06:16

Dernière modification le : mardi 3 octobre 2023-09:49:09

Dates et versions

hal-01199082 , version 1 (14-09-2015)

Identifiants

HAL Id : hal-01199082 , version 1

Citer

Jonathan Chevelu, Damien Lolive, Sébastien Le Maguer, David Guennec. How to Compare TTS Systems: A New Subjective Evaluation Methodology Focused on Differences. Interspeech, Sep 2015, Dresden, Germany. ⟨hal-01199082⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM UNIV-RENNES1 CNRS INRIA INSA-RENNES ENSSAT IRISA CENTRALESUPELEC IRISA-D6 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

534 Consultations

0 Téléchargements

How to Compare TTS Systems: A New Subjective Evaluation Methodology Focused on Differences

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager