Voice Cloning Applied to Voice Disorders: a Study of Extreme Phonetic Content in Speaker Embeddings - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

Voice Cloning Applied to Voice Disorders: a Study of Extreme Phonetic Content in Speaker Embeddings

Résumé

Organic dysphonia can lead to vocal impairments. Recording patients' impaired voice could allow them to use voice cloning systems. In the domain of speech synthesis, voice cloning is the process of producing speech matching a target speaker voice, given textual input and an audio sample from the speaker. It can achieve high-quality speech with only few data from the target speaker. However, dysphonic patients may only produce speech with specific or limited phonetic content. To our knowledge, the impact of such constraints on a voice cloning system remains to be studied. This article presents the results of preliminary experiments on the matter, along with specifications about the models and datasets used.
Fichier non déposé

Dates et versions

hal-03697484 , version 1 (16-06-2022)

Identifiants

  • HAL Id : hal-03697484 , version 1

Citer

Lily Wadoux, Nelly Barbot, Damien Lolive, Jonathan Chevelu. Voice Cloning Applied to Voice Disorders: a Study of Extreme Phonetic Content in Speaker Embeddings. 35th Canadian Conference on Artificial Intelligence, May 2022, Toronto, Canada. ⟨hal-03697484⟩
86 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More