E. Moulines and F. Charpentier, Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones, Speech Communication, vol.9, issue.5-6, pp.453-467, 1990.

A. J. Hunt and A. W. Black, Unit selection in a concatenative speech synthesis system using a large speech database, ICASSP 1996-21 st IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.373-376, 1996.

A. W. Black and K. Lenzo, Optimal utterance selection for unit selection speech synthesis databases, International Journal of speech technology, vol.6, issue.4, pp.357-363, 2003.

S. J. Young, The HTK HMM toolkit: Design and philosophy, Cambridge Univ. Eng. Dept. Tech. Rpt. CUED/F-INFENG/TR, vol.152, 1993.

T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, Simultaneous modeling of spectrum, pitch and duration in hmm-based speech synthesis, Proceedings of the EUROSPEECH, vol.5, pp.2347-2350, 1999.

J. Yamagishi, T. Nose, H. Zen, Z. H. Ling, T. Toda et al., Robust speaker-adaptive HMM-based text-tospeech synthesis, IEEE Transactions on Audio, Speech, and Language Processing, vol.17, issue.6, pp.1208-1230, 2009.

H. Zen, K. Tokuda, and A. W. Black, Statistical parametric speech synthesis, Speech Communication, vol.51, issue.11, pp.1039-1064, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00746106

H. Zen, A. Senior, and M. Schuster, Statistical parametric speech synthesis using deep neural networks, ICASSP 201338 th IEEE International Conference on Acoustics, Speech and Signal Processing, pp.7962-7966, 2013.

Z. Wu, O. Watts, and S. King, Merlin: An open source neural network speech synthesis system, 9 th ISCA Workshop on Speech Synthesis, 2016.

A. V. Oord, S. Dieleman, H. Zen, K. Simonyan, O. Vinyals et al., Wavenet: A generative model for raw audio, 2016.

R. Abdelmalek and Z. Mnasri, High quality Arabic text-tospeech synthesis using unit selection, SSD 2016-13 th IEEE International Multi-Conference on Systems, Signals and Devices, pp.1-5, 2016.

O. Abdel-hamid, S. Abdou, and M. Rashwan, Improving Arabic HMM based speech synthesis quality, INTERSPEECH 2006-9 th Annual Conference of the International Speech Communication Association, pp.1332-1335, 2006.

A. Houidhek, V. Colotte, Z. Mnasri, D. Jouvet, and I. Zangar, Statistical modelling of speech units in HMM-based speech synthesis for Arabic, LTC 2017-8 th Language & Technology Conference, pp.1-5, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01649034

D. Newman, The phonetics of Arabic, Journal of the American Oriental Society, vol.46, pp.1-6, 1984.

S. Baloul, Développement d'un système automatique de synthèse de la parole à partir du texte arabe standard voyellé. Le Mans : Doctoral dissertation, 2003.

H. Zen and H. Sak, Unidirectional long short-term memory recurrent neural network with recurrent output layer for lowlatency speech synthesis, ICASSP 2015-40 th IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4470-4474, 2015.
DOI : 10.1109/icassp.2015.7178816
URL : http://static.googleusercontent.com/media/research.google.com/en/us/pubs/archive/43266.pdf

Z. Mnasri, F. Boukadida, and N. Ellouze, Modeling Segmental Duration by Statistical Learning for an Arabic Text-to-Speech System, International Review on Computers and Software, vol.4, issue.5, 2009.

D. H. Klatt, Linguistic uses of segmental duration in English: Acoustic and perceptual evidence, The Journal

J. P. Van-santen, Prosodic modelling in text-to-speech synthesis, Eurospeech 1997-5 th European Conference on Speech Communication and Technology, pp.19-28, 1997.

T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, Duration Modeling in HMM based Speech Synthesis System, Proceedings of ICSLP, vol.2, pp.29-32, 1998.

W. N. Campbell, Syllable-based segmental durationTalking machines: Theories, models, and designs, 1992.

H. Mixdorff and O. Jockisch, Building an integrated prosodic model of German, Speech Communication and Technology, pp.947-950, 2001.

G. E. Henter, S. Ronanki, O. Watts, M. Wester, Z. Wu et al., Robust tts duration modelling using dnns, ICASSP 2016-41 st IEEE International Conference on Acoustics, Speech and Signal Processing, pp.5130-5134, 2016.
DOI : 10.1109/icassp.2016.7472655
URL : https://www.pure.ed.ac.uk/ws/files/23618761/henter2016robust_final_1.pdf

B. Chen, B. Tianling, and Y. Kai, Discrete duration model for speech synthesis, INTERSPEECH 2017-18 th Annual Conference of the International Speech Communication Association, pp.789-793, 2017.
DOI : 10.21437/interspeech.2017-1144

K. M. Rosen, Analysis of speech segmental duration with the lognormal distribution: A basis for unification and comparison, Journal of Phonetics, vol.33, issue.4, pp.411-426, 2005.

N. Halabi and W. Wald, Phonetic inventory for an Arabic speech corpus, LREC 2016-10 th International Conference on Language Resources and Evaluation, pp.734-738, 2016.