J. Yamagishi, Z. Ling, and S. King, Robustness of HMM-based speech synthesis, Proc. of Interspeech, pp.2-5, 2008.

H. Ze, A. Senior, and M. Schuster, Statistical parametric speech synthesis using deep neural networks, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.7962-7966, 2013.

Y. Sagisaka, Speech synthesis by rule using an optimal selection of non-uniform synthesis units, Proc. of ICASSP. IEEE, pp.679-682, 1988.

A. W. Black and P. Taylor, CHATR: a generic speech synthesis system, Association for Computational Linguistics, vol.2, pp.983-986, 1994.

A. Hunt and A. W. Black, Unit selection in a concatenative speech synthesis system using a large speech database, Proc. of ICASSP, vol.1, pp.373-376, 1996.

P. Taylor, A. Black, and R. Caley, The architecture of the Festival speech synthesis system, Proc. of the ESCA Workshop in Speech Synthesis, pp.147-151, 1998.

A. P. Breen and P. Jackson, Non-uniform unit selection and the similarity metric within BTs Laureate TTS system, Proc. of the ESCA Workshop on Speech Synthesis, pp.373-376, 1998.

R. A. Clark, K. Richmond, and S. King, Multisyn: Open-domain unit selection for the Festival speech synthesis system, Speech Communication, vol.49, issue.4, pp.317-330, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00499177

H. Patil, T. Patel, N. Shah, H. Sailor, R. Krishnan et al., A syllable-based framework for unit selection synthesis in 13 indian languages, Proc. OCOCOSDA, pp.1-8, 2013.

Y. Stylianou and A. Syrdal, Perceptual and objective detection of discontinuities in concatenative speech synthesis, Proc. of ICASSP, vol.2, pp.837-840, 2001.

D. Tihelka, J. Matou?ek, and Z. Hanzlí?ek, Modelling F0 Dynamics in Unit Selection Based Speech Synthesis, Proc. of TSD, pp.457-464, 2014.

P. Boersma, Praat, a system for doing phonetics by computer, Glot international, vol.5, issue.9, pp.341-345, 2002.

D. Talkin, A robust algorithm for pitch tracking (RAPT)," in Speech coding and synthesis, pp.495-518, 1995.

J. Duddington, eSpeak text to speech, 2012.

S. Young, G. Evermann, M. Gales, T. Hein, D. Kershaw et al., The HTK book. for version, vol.3, issue.3, 2005.

J. Chevelu, G. Lecorvé, and D. Lolive, ROOTS: a toolkit for easy, fast and consistent processing of large sequential annotated data collections, Proc. of LREC, pp.619-626, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00974628

D. Guennec and D. Lolive, Unit Selection Cost Function Exploration Using an A* based Text-to-Speech System, Proc. of TSD, pp.432-440, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01133321

P. Alain, J. Chevelu, D. Guennec, G. Lecorvé, and D. Lolive, The IRISA Text-To-Speech System for the Blizzard Challenge, Blizzard Challenge 2016 workshop, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01662361

C. Blouin, O. Rosec, P. Bagshaw, and C. D'alessandro, Concatenation cost calculation and optimisation for unit selection in TTS, IEEE Workshop on Speech Synthesis, pp.0-3, 2002.

F. Alías, L. Formiga, and X. Llorá, Efficient and reliable perceptual weight tuning for unit-selection text-to-speech synthesis based on active interactive genetic algorithms: A proof-ofconcept, Speech Communication, vol.53, issue.5, pp.786-800, 2011.

A. Perquin, G. Lecorvé, D. Lolive, and L. Amsaleg, Phone-level embeddings for unit selection speech synthesis, Proc. of the International Conference on Statistical Language and Speech Processing (SLSP), 2018.
URL : https://hal.archives-ouvertes.fr/hal-01840812

M. Morise, F. Yokomori, and K. Ozawa, World: a vocoder-based high-quality speech synthesis system for real-time applications, IEICE TRANSACTIONS on Information and Systems, vol.99, issue.7, pp.1877-1884, 2016.

M. Tahon, R. Qader, G. Lecorvé, and D. Lolive, Improving tts with corpus-specific pronunciation adaptation, Proc. of Interspeech, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01338111