Disfluency Insertion for Spontaneous TTS: Formalization and Proof of Concept

Raheel Qader 1 Gwénolé Lecorvé 1 Damien Lolive 1 Pascale Sébillot 2
1 EXPRESSION - Expressiveness in Human Centered Data/Media
UBS - Université de Bretagne Sud, IRISA-D6 - MEDIA ET INTERACTIONS
2 LinkMedia - Creating and exploiting explicit links between multimedia fragments
Inria Rennes – Bretagne Atlantique , IRISA_D6 - MEDIA ET INTERACTIONS
Abstract : This paper presents an exploratory work to automatically insert disfluencies in text-to-speech (TTS) systems. The objective is to make TTS more spontaneous and expressive. To achieve this, we propose to focus on the linguistic level of speech through the insertion of pauses, repetitions and revisions. We formalize the problem as a theoretical process, where transformations are iteratively composed. This is a novel contribution since most of the previous work either focus on the detection or cleaning of linguistic disfluencies in speech transcripts, or solely concentrate on acoustic phenomena in TTS, especially pauses. We present a first implementation of the proposed process using conditional random fields and language models. The objective and perceptual evalation conducted on an English corpus of spontaneous speech show that our proposition is effective to generate disfluencies, and highlights perspectives for future improvements.
Type de document :
Communication dans un congrès
SLSP 2018 - 6th International Conference on Statistical Language and Speech Processing, Oct 2018, Mons, Belgium. pp.1-12
Liste complète des métadonnées

Littérature citée [25 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01840798
Contributeur : Gwénolé Lecorvé <>
Soumis le : lundi 16 juillet 2018 - 16:44:18
Dernière modification le : lundi 15 octobre 2018 - 13:40:04
Document(s) archivé(s) le : mercredi 17 octobre 2018 - 15:56:26

Fichier

disfluencies_slsp_camera_ready...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01840798, version 1

Citation

Raheel Qader, Gwénolé Lecorvé, Damien Lolive, Pascale Sébillot. Disfluency Insertion for Spontaneous TTS: Formalization and Proof of Concept. SLSP 2018 - 6th International Conference on Statistical Language and Speech Processing, Oct 2018, Mons, Belgium. pp.1-12. 〈hal-01840798〉

Partager

Métriques

Consultations de la notice

222

Téléchargements de fichiers

36