Skip to Main content Skip to Navigation
Conference papers

Duration modeling using DNN for Arabic speech synthesis

Imene Zangar 1 Zied Mnasri 1 Vincent Colotte 2 Denis Jouvet 2 Amal Houidhek 2 
2 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : Duration modeling is a key task for every parametric speech synthesis system. Though such parametric systems have been adapted to many languages, no special attention was paid to explicitly handling Arabic speech characteristics. Actually, in Arabic phoneme duration has a distinctive role, because of consonant gemination and vowel quantity. Therefore, a precise modeling of sound durations is critical. In this paper we compare several modeling of phoneme durations (including duration modeling by HTS and MERLIN toolkits), and we propose a new approach which relies on using a set of models, each one being optimal for a given phoneme class (e.g., simple consonants, geminated consonants, short vowels, and long vowels). An objective evaluation carried out on a set of test sentences shows that the proposed approach leads to a more accurate modeling of the phoneme durations.
Document type :
Conference papers
Complete list of metadata

Cited literature [26 references]  Display  Hide  Download
Contributor : Denis Jouvet Connect in order to contact the contributor
Submitted on : Monday, October 8, 2018 - 10:45:02 AM
Last modification on : Saturday, June 25, 2022 - 7:43:39 PM
Long-term archiving on: : Wednesday, January 9, 2019 - 1:52:04 PM


Files produced by the author(s)


  • HAL Id : hal-01889917, version 1


Imene Zangar, Zied Mnasri, Vincent Colotte, Denis Jouvet, Amal Houidhek. Duration modeling using DNN for Arabic speech synthesis. 9th International Conference on Speech Prosody, Jun 2018, Poznań, Poland. ⟨hal-01889917⟩



Record views


Files downloads