Statistical modelling of speech units in HMM-based speech synthesis for Arabic

Amal Houidhek 1, 2 Vincent Colotte 1 Zied Mnasri 2 Denis Jouvet 1 Imene Zangar 2
1 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper investigates statistical parametric speech synthesis of Modern Standard Arabic (MSA). Hidden Markov Models (HMM)-based speech synthesis system relies on a description of speech segments corresponding to phonemes, with a large set of features that represent phonetic, phonologic, linguistic and contextual aspects. When applied to MSA two specific phenomena have to be taken in account, the vowel lengthening and the consonant gemination. This paper studies thoroughly the modeling of these phenomena through various approaches: as for example, the use of different units for modeling short vs. long vowels and the use of different units for modeling simple vs. geminated consonants. These approaches are compared to another one which merges short and long variants of a vowel into a single unit and, simple and geminated variants of a consonant into a single unit (these characteristics being handled through the features associated to the sound). Results of subjective evaluation show that there is no significant difference between using the same unit for simple and geminated consonant (as well as for short and long vowels) and using different units for simple vs. geminated consonants (as well for short vs. long vowels).
Type de document :
Communication dans un congrès
LTC 2017 - 8th Language & Technology Conference, Nov 2017, Poznań, Poland. pp.1-5, 2017
Liste complète des métadonnées

Littérature citée [27 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01649034
Contributeur : Denis Jouvet <>
Soumis le : lundi 27 novembre 2017 - 10:35:59
Dernière modification le : jeudi 11 janvier 2018 - 06:27:31

Fichier

ltc-27-houidhek--final-version...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01649034, version 1

Citation

Amal Houidhek, Vincent Colotte, Zied Mnasri, Denis Jouvet, Imene Zangar. Statistical modelling of speech units in HMM-based speech synthesis for Arabic. LTC 2017 - 8th Language & Technology Conference, Nov 2017, Poznań, Poland. pp.1-5, 2017. 〈hal-01649034〉

Partager

Métriques

Consultations de la notice

339

Téléchargements de fichiers

59