Skip to Main content Skip to Navigation
Conference papers

Layer adaptation for transfer of expressivity in speech synthesis

Ajinkya Kulkarni 1 Vincent Colotte 1 Denis Jouvet 1
1 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : Expressive speech synthesis using parametric approaches is constrained by the style of the speech corpus used. In this paper, we present the development of an expressive speech synthesis for a new speaker voice without requiring a specific recording of expressive speech by new speaker. We propose deep neural network based layer adaptation framework for transferring the expressive characteristics of speech to a new speaker's voice for which only neutral speech data is available. The focus of the work is on investigating transfer learning mechanism, which will accelerate the efforts towards exploiting existing expressive speech corpus. Experiments using expressive Caroline speech corpus and neutral Lisa speech corpus shows layer adaptation technique is able to transfer expressive characteristics while keeping the speaker's style characteristics.
Complete list of metadatas

Cited literature [24 references]  Display  Hide  Download

https://hal.inria.fr/hal-02177945
Contributor : Vincent Colotte <>
Submitted on : Tuesday, July 9, 2019 - 2:15:08 PM
Last modification on : Wednesday, July 24, 2019 - 10:55:10 AM

File

LTC19.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02177945, version 1

Collections

Citation

Ajinkya Kulkarni, Vincent Colotte, Denis Jouvet. Layer adaptation for transfer of expressivity in speech synthesis. LTC'19 - 9th Language & Technology Conference, May 2019, Poznan, Poland. ⟨hal-02177945⟩

Share

Metrics

Record views

118

Files downloads

491