Skip to Main content Skip to Navigation
New interface
Conference papers

Layer adaptation for transfer of expressivity in speech synthesis

Ajinkya Kulkarni 1 Vincent Colotte 1 Denis Jouvet 1 
1 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : Expressive speech synthesis using parametric approaches is constrained by the style of the speech corpus used. In this paper, we present the development of an expressive speech synthesis for a new speaker voice without requiring a specific recording of expressive speech by new speaker. We propose deep neural network based layer adaptation framework for transferring the expressive characteristics of speech to a new speaker's voice for which only neutral speech data is available. The focus of the work is on investigating transfer learning mechanism, which will accelerate the efforts towards exploiting existing expressive speech corpus. Experiments using expressive Caroline speech corpus and neutral Lisa speech corpus shows layer adaptation technique is able to transfer expressive characteristics while keeping the speaker's style characteristics.
Complete list of metadata

Cited literature [24 references]  Display  Hide  Download
Contributor : Vincent Colotte Connect in order to contact the contributor
Submitted on : Tuesday, July 9, 2019 - 2:15:08 PM
Last modification on : Saturday, June 25, 2022 - 7:40:42 PM


Files produced by the author(s)


  • HAL Id : hal-02177945, version 1


Ajinkya Kulkarni, Vincent Colotte, Denis Jouvet. Layer adaptation for transfer of expressivity in speech synthesis. LTC'19 - 9th Language & Technology Conference, May 2019, Poznan, Poland. ⟨hal-02177945⟩



Record views


Files downloads