Duration modeling using DNN for Arabic speech synthesis

Imene Zangar; Zied Mnasri; Vincent Colotte; Denis Jouvet; Amal Houidhek

Communication Dans Un Congrès Année : 2018

Duration modeling using DNN for Arabic speech synthesis

(1) , (1) , (2) , (2) , (2)

1
2

Imene Zangar

Fonction : Auteur

Ecole Nationale d'Ingénieurs de Tunis

Zied Mnasri

Fonction : Auteur

Ecole Nationale d'Ingénieurs de Tunis

Vincent Colotte

Fonction : Auteur
PersonId : 16268
IdHAL : vincent-colotte
IdRef : 070401683

Speech Modeling for Facilitating Oral-Based Communication

Denis Jouvet

Fonction : Auteur
PersonId : 15904
IdHAL : denis-jouvet
IdRef : 029418666

Speech Modeling for Facilitating Oral-Based Communication

Amal Houidhek

Fonction : Auteur

Speech Modeling for Facilitating Oral-Based Communication

Résumé

Duration modeling is a key task for every parametric speech synthesis system. Though such parametric systems have been adapted to many languages, no special attention was paid to explicitly handling Arabic speech characteristics. Actually, in Arabic phoneme duration has a distinctive role, because of consonant gemination and vowel quantity. Therefore, a precise modeling of sound durations is critical. In this paper we compare several modeling of phoneme durations (including duration modeling by HTS and MERLIN toolkits), and we propose a new approach which relies on using a set of models, each one being optimal for a given phoneme class (e.g., simple consonants, geminated consonants, short vowels, and long vowels). An objective evaluation carried out on a set of test sentences shows that the proposed approach leads to a more accurate modeling of the phoneme durations.

Mots clés

MERLIN phoneme duration modeling DNN Arabic TTS HTS

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

SP18_paper_78_version_final.pdf (239.9 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Denis Jouvet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01889917

Soumis le : lundi 8 octobre 2018-10:45:02

Dernière modification le : lundi 11 septembre 2023-17:41:19

Archivage à long terme le : mercredi 9 janvier 2019-13:52:04

Dates et versions

hal-01889917 , version 1 (08-10-2018)

Identifiants

HAL Id : hal-01889917 , version 1

Citer

Imene Zangar, Zied Mnasri, Vincent Colotte, Denis Jouvet, Amal Houidhek. Duration modeling using DNN for Arabic speech synthesis. 9th International Conference on Speech Prosody, Jun 2018, Poznań, Poland. ⟨hal-01889917⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD

186 Consultations

529 Téléchargements

Duration modeling using DNN for Arabic speech synthesis

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager