C. Olinsky and F. Cummins, Iterative English accent adaptation in a speech synthesis system, Proceedings of 2002 IEEE Workshop on Speech Synthesis, 2002., pp.79-82, 2002.
DOI : 10.1109/WSS.2002.1224377

D. Govind and S. M. Prasanna, Expressive speech synthesis: a review, International Journal of Speech Technology, vol.51, issue.4, pp.237-260, 2013.
DOI : 10.1007/s10772-012-9180-2

T. Fukada, T. Yoshimura, and Y. Sagisaka, Automatic generation of multiple pronunciations based on neural networks, Speech Communication, vol.27, issue.1, pp.63-73, 1999.
DOI : 10.1016/S0167-6393(98)00066-1

G. Tajchman, E. Foster, and D. Jurafsky, Building multiple pronunciation models for novel words using exploratory computational phonology, European Conference on Speech Communication and Technology (Eurospeech), 1995.

P. Karanasou, F. Yvon, T. Lavergne, and L. Lamel, Discriminative training of a phoneme confusion model for a dynamic lexicon in ASR, Annual Conference of the International Speech Communication Association (Interspeech), pp.1966-1970, 2013.

G. Lecorvé and D. Lolive, Adaptive statistical utterance phonetization for French, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.4864-4868, 2015.
DOI : 10.1109/ICASSP.2015.7178895

T. J. Hazen, I. L. Hetherington, H. Shu, and K. Livescu, Pronunciation modeling using a finite-state transducer representation, Speech Communication, vol.46, issue.2, pp.189-203, 2005.
DOI : 10.1016/j.specom.2005.03.004

K. Livescu, P. Jyothi, and E. Fosler-lussier, Articulatory feature-based pronunciation modeling, Computer Speech & Language, vol.36, pp.212-232, 2016.
DOI : 10.1016/j.csl.2015.07.003

B. Vazirnezhad, F. Almasganj, and S. Ahadi, Hybrid statistical pronunciation models designed to be trained by a medium-size corpus, Computer Speech & Language, vol.23, issue.1, pp.1-24, 2009.
DOI : 10.1016/j.csl.2008.02.001

A. Bell, D. Jurafsky, E. Fosler-lussier, C. Girand, M. Gregory et al., Effects of disfluencies, predictability, and utterance position on word form variation in English conversation, The Journal of the Acoustical Society of America, vol.113, issue.2, pp.1001-1024, 2003.
DOI : 10.1121/1.1534836

A. Bell, J. Brenier, M. Gregory, C. Girand, and D. Jurafsky, Predictability effects on durations of content and function words in conversational English, Journal of Memory and Language, vol.60, issue.1, pp.92-111, 2009.
DOI : 10.1016/j.jml.2008.06.003

M. Adda-decker, P. B. De-mareüil, G. Adda, and L. Lamel, Investigating syllabic structures and their variation in spontaneous French, Speech Communication, vol.46, issue.2, pp.119-139, 2005.
DOI : 10.1016/j.specom.2005.03.006

R. A. Bates, M. Osendorf, and R. A. Wright, Symbolic phonetic features for modeling of pronunciation variation, Speech Communication, vol.49, issue.2, pp.83-97, 2007.
DOI : 10.1016/j.specom.2006.10.007

C. L. Bennett and A. W. Black, Prediction of Pronunciation Variations for Speech Synthesis: A Data-Driven Approach, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., pp.297-300, 2005.
DOI : 10.1109/ICASSP.2005.1415109

K. Chen and M. Hasegawa-johnson, Modeling pronunciation variation using artificial neural networks for English spontaneous speech, 8th International Conference on Spoken Language Processing (ICSLP), pp.4-8, 2004.

P. B. De-mareüil and M. Adda-decker, Studying pronunciation variants in French by using alignment techniques, International Conference on Spoken Language Processing, 2002.

R. Qader, G. Lecorvé, D. Lolive, and P. Sébillot, Probabilistic Speaker Pronunciation Adaptation for Spontaneous Speech Synthesis Using Linguistic Features, International Conference on Statistical Language and Speech Processing, p.2015
DOI : 10.1007/978-3-319-25789-1_22

URL : https://hal.archives-ouvertes.fr/hal-01181192

J. Chevelu, G. Lecorvé, and D. Lolive, ROOTS: a toolkit for easy, fast and consistent processing of large sequential annotated data collections, 9th International Language Resources and Evaluation Conference (LREC), European Language Resources Association (ELRA), 2014.
URL : https://hal.archives-ouvertes.fr/hal-00974628

]. F. Béchet, LIA-PHON: un système complet de phonétisation de texte, Traitement Automatique des Langues (TAL), vol.42, issue.1, pp.47-67, 2001.

Y. Lin, J. Michel, E. L. Aiden, J. Orwant, W. Brockman et al., Syntactic annotations for the google books ngram corpus, 50th Annual Meeting of the Association for Computational Linguistics, pp.169-174, 2012.

C. Dalessandro, S. Rosset, and J. Rossi, The pitch of short-duration fundamental frequency glissandos, The Journal of the Acoustical Society of America, vol.104, issue.4, pp.2339-2348, 1998.
DOI : 10.1121/1.423745

T. Lavergne, O. Cappé, and F. Yvon, Practical very large scale CRFs, Proceedings the 48th Annual Meeting of the Association for Computational Linguistics (ACL), pp.504-513, 2010.

I. Guyon and A. Elissef, An introduction to variable and feature selection, Journal of Machine Learning Research, vol.3, pp.1157-1182, 2003.

D. Guennec and D. Lolive, Unit Selection Cost Function Exploration Using an A* Based Text-to-Speech System, 17th International Conference on Text, Speech and Dialogue, pp.449-457, 2014.
DOI : 10.1007/978-3-319-10816-2_52

URL : https://hal.archives-ouvertes.fr/hal-01133321

H. Zen, T. Nose, J. Yamagishi, S. Sako, T. Masuko et al., The HMM-based speech synthesis system (HTS) version 2.0, Speech Synthesis Workshop (SSW), pp.294-299, 2007.