D. Lolive, N. Barbot, and O. Boëffard, B-spline model order selection with optimal MDL criterion applied to speech fundamental frequency stylisation, IEEE Journal of Selected Topics in Signal Processing, vol.43, pp.571-581, 2010.
DOI : 10.1109/jstsp.2010.2048236
URL : https://hal.archives-ouvertes.fr/inria-00538937

H. Yoo, E. Delais-roussarie, D. Lolive, and N. Barbot, Le Rythme en Lecture Oralisée (parole synthétique et parole naturelle, Revue Française de Linguistique Appliquée XX.2, pp.63-77, 2015.

N. Barbot, O. Boëffard, and D. Lolive, F0 stylisation with a free-knot b-spline model and simulated-annealing optimization, Proceedings of the 9th European Conference on Speech Communication and Technology (Eurospeech), 2005.
URL : https://hal.archives-ouvertes.fr/hal-01199085

D. Lolive, N. Barbot, and O. Boëffard, Comparing B-Spline and Spline Models for F0 Modelling, Lecture Notes in Artificial Intelligence - Proceedings of the 9th International Conference on Text, Speech and Dialogue, 2006.
DOI : 10.1007/11846406_53
URL : https://hal.archives-ouvertes.fr/hal-01199086

D. Lolive, N. Barbot, and O. Boëffard, Melodic contour estimation with b-spline models using a MDL criterion, Proceedings of the 11th International Conference on Speech and Computer (SPECOM). Saint, 2006.
URL : https://hal.archives-ouvertes.fr/hal-01199087

D. Lolive, N. Barbot, and O. Boëffard, Clustering algorithm for f0 curves based on hidden markov models, Proceedings of the 6th ISCA Tutorial and Research Workshop on Speech Synthesis, 2007.
URL : https://hal.archives-ouvertes.fr/hal-01199089

D. Lolive, N. Barbot, and O. Boëffard, Unsupervised HMM classification of f0 curves, 2007.
URL : https://hal.archives-ouvertes.fr/hal-01199088

D. Lolive, N. Barbot, and O. Boëffard, Pitch and duration transformation with non parallel data, 4th conference of Speech Prosody, pp.111-114, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00987810

D. Lolive, N. Barbot, and O. Boëffard, An evaluation methodology for prosody transformation systems based on chirp signals, In : Interspeech. Brighton, United Kingdom, pp.2635-2638, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00976430

N. Barbot, V. Barreaud, O. Boëffard, L. Charonnat, A. Delhay et al., Towards a Versatile Multi-Layered Description of Speech Corpora Using Algebraic Relations, Conference of the International Speech Communication Association (Interspeech), pp.1501-1504, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00657283

O. Boëffard, C. Laure, S. L. Maguer, and D. Lolive, Towards Fully Automatic Annotation of Audio Books for TTS, LREC -Eighth International Conference on Language Resources and Evaluation, 2012.

. Avanzi, G. Mathieu, E. Christodoulides, N. Delais-roussarie, D. Barbot et al., Towards the Adaptation of Prosodic Models for Expressive Text-To-Speech Synthesis, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01133316

J. Chevelu, G. Lecorvé, and D. Lolive, ROOTS : a toolkit for easy, fast and consistent processing of large sequential annotated data collections, Language Resources and Evaluation Conference (LREC), 2014.
URL : https://hal.archives-ouvertes.fr/hal-00974628

E. Delais-roussarie, D. Lolive, H. Yoo, N. Barbot, and O. Rosec, Adapting prosodic chunking algorithm and synthesis system to specific style, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01133319

D. Guennec and D. Lolive, Unit Selection Cost Function Exploration Using an A* Based Text-to-Speech System, International Conference on Text, Speech and Dialogue (TSD), 2014.
DOI : 10.1007/978-3-319-10816-2_52
URL : https://hal.archives-ouvertes.fr/hal-01133321

L. Maguer, E. Sébastien, N. Delais-roussarie, M. Barbot, O. Avanzi et al., Prosodic chunking algorithm for dictation with the use of speech synthesis, Proc. of Speech Prosody, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00973866

P. Alain, J. Chevelu, D. Guennec, G. Lecorvé, and D. Lolive, The IRISA Text-To-Speech System for the Blizzard Challenge, Blizzard Challenge 2015 Workshop, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01375897

J. Chevelu and D. Lolive, Do not build your TTS training corpus randomly, 2015 23rd European Signal Processing Conference (EUSIPCO), 2015.
DOI : 10.1109/EUSIPCO.2015.7362403
URL : https://hal.archives-ouvertes.fr/hal-01199083

J. Chevelu, D. Lolive, S. L. Maguer, and D. Guennec, How to Compare TTS Systems : A New Subjective Evaluation Methodology Focused on Differences, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01199082

D. Guennec, J. Chevelu, and D. Lolive, Defining a Global Adaptive Duration Target Cost for Unit Selection Speech Synthesis, International Conference on Text, Speech and Dialogue (TSD). Proceedings of International Conference on Text, Speech and Dialogue (TSD). PLZ? N, Czech Republic, pp.157-165, 2015.
DOI : 10.1007/978-3-319-10816-2_52
URL : https://hal.archives-ouvertes.fr/hal-01188686

G. Lecorvé and D. Lolive, Adaptive statistical utterance phonetization for French, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015.
DOI : 10.1109/ICASSP.2015.7178895

R. Qader, G. Lecorvé, D. Lolive, and P. Sébillot, Probabilistic Speaker Pronunciation Adaptation for Spontaneous Speech Synthesis Using Linguistic Features, International Conference on Statistical Language and Speech Processing (SLSP), pp.229-241, 2015.
DOI : 10.1109/LSP.2010.2098440
URL : https://hal.archives-ouvertes.fr/hal-01181192

P. Alain, J. Chevelu, D. Guennec, G. Lecorvé, and D. Lolive, The IRISA Text-To-Speech System for the Blizzard Challenge, Blizzard Challenge 2016 workshop. Cupertino, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01375897

C. Fayet, A. Delhay, D. Lolive, and P. Marteau, Big Five vs. Prosodic Features as Cues to Detect Abnormality in SSPNET-Personality Corpus, Interspeech 2017, 2016.
DOI : 10.21437/Interspeech.2017-1194

C. Fayet, A. Delhay, D. Lolive, and P. Marteau, First Experiments to Detect Anomaly Using Personality Traits vs. Prosodic Features, 2016.
DOI : 10.1109/TAFFC.2014.2330816

D. Guennec and D. Lolive, On the Suitability of Vocalic Sandwiches in a Corpus-Based TTS Engine, Interspeech 2016, 2016.
DOI : 10.21437/Interspeech.2016-1222
URL : https://hal.archives-ouvertes.fr/hal-01338839

M. Tahon, R. Qader, G. Lecorvé, and D. Lolive, Improving TTS with Corpus-Specific Pronunciation Adaptation, Interspeech 2016, 2016.
DOI : 10.21437/Interspeech.2016-864
URL : https://hal.archives-ouvertes.fr/hal-01338111

M. Tahon, R. Qader, G. Lecorvé, and D. Lolive, Optimal Feature Set and Minimal Training Size for Pronunciation Adaptation in TTS, International Conference on Statistical Language and Speech Processing (SLSP), 2016.
DOI : 10.21437/Interspeech.2016-864
URL : https://hal.archives-ouvertes.fr/hal-01338853

D. Lolive, N. Barbot, and O. Boëffard, Modélisation b-spline de contours mélodiques avec estimation du nombre de paramètres libres par un critère MDL, 2006.

D. Lolive, N. Barbot, and O. Boëffard, Proposition d'un critère MDL pour l'estimation de courbes ouvertes modélisées par des b-splines, Actes de la 8` eme Conférence Francophone sur l'Apprentissage Automatique, 2006.

D. Lolive, N. Barbot, and O. Boëffard, Transformation de la prosodie par adaptation MLLR de GMM, Actes des XXVIIèmes Journées d'Etudes sur la Parole, 2008.
URL : https://hal.archives-ouvertes.fr/hal-01199092

R. Qader, G. Lecorvé, D. Lolive, and P. Sébillot, Phonology Modelling for Expressive Speech Synthesis : a Review, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01021911

D. Lolive, Prosody transformation : application to speech synthesis and voice transformation, Theses. Université de Rennes, vol.1, 2008.
URL : https://hal.archives-ouvertes.fr/tel-01199093

F. Saussure-de, Cours de linguistique générale. ´ edition originale : 1916, ´ edition 1979 : Payot, 1916.

B. Lindblom, Spectrographic study of vowel reduction, The Journal of the Acoustical Society of America, vol.35, 1963.
DOI : 10.1121/1.2142410

A. J. Viterbi, Error bounds for convolutional codes and an asymptotically optimum decoding algorithm, IEEE Transactions on Information Theory, vol.132, pp.260-269, 1967.
DOI : 10.1109/tit.1967.1054010

P. Hart, B. Nilsson, and . Raphael, A Formal Basis for the Heuristic Determination of Minimum Cost Paths, IEEE Transactions on Systems Science and Cybernetics, vol.4, issue.2, 1968.
DOI : 10.1109/TSSC.1968.300136

H. Sakoe, S. Et, and . Chiba, Dynamic programming algorithm optimization for spoken word recognition, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.26, issue.1, pp.43-49, 1978.
DOI : 10.1109/TASSP.1978.1163055

I. Fónagy, L'accent français : accent probabilitaire (dynamique d'un changement prosodique), Studia Phonetica Montréal, vol.15, pp.123-233, 1980.

N. J. Nilsson, Principles of Artificial Intelligence, pp.3-540, 1982.
DOI : 10.1007/978-3-662-09438-9

P. Verluyten, Recherches sur la prosodie et la métrique du français, Thèse de doct, 1982.

J. E. Cahn, The Generation of Affect in Synthesized Speech, Journal of American Voice I/O Society, vol.8, pp.1-19, 1990.

E. Moulines and F. Charpentier, Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones " . eng. In : Speech communication 9, pp.5-6, 1990.
DOI : 10.1016/0167-6393(90)90021-z

R. Carlson, Synthesis: Modeling variability and constraints, Speech Communication, vol.11, issue.2-3, pp.159-166, 1992.
DOI : 10.1016/0167-6393(92)90010-5
URL : http://www.speech.kth.se/prod/publications/files/qpsr/1991/1991_32_4_001-009.pdf

C. 4. Adaptation, . Du, . De-synth-`-esesynth-`-synth-`-ese, A. W. Black, and P. Taylor, CHATR : a generic speech synthesis system, 15th conference on Computational linguistics, pp.983-986, 1994.

M. Riedi, A neural-network-based model of segmental duration for speech synthesis, Proceedings of Eurospeech, 1995.

E. Delais-roussarie, Phonological phrasing and accentuation in French In : Dam Phonology : HIL phonology papers II, den Haag : Holland Academic Graphics, pp.1-38, 1996.

A. J. Hunt, W. Et-alan, and . Black, Unit selection in a concatenative speech synthesis system using a large speech database, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, pp.373-376, 1996.
DOI : 10.1109/ICASSP.1996.541110

A. J. Hunt, W. Et-alan, and . Black, Unit selection in a concatenative speech synthesis system using a large speech database ICASSP-96, Acoustics, Speech, and Signal Processing IEEE International Conference on. T. 1. IEEE, pp.373-376, 1996.

I. R. Murray, J. L. Et, and . Arnott, Synthesizing emotions in speech: is it time to get excited?, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, pp.1816-1819, 1996.
DOI : 10.1109/ICSLP.1996.607983

A. P. Breen and . Jackson, Non-uniform unit selection and the similarity metric within BT's Laureate TTS system, The Third ESCA/COCOSDA Workshop (ETRW) on Speech Synthesis, 1998.

O. Karaali, G. Corrigan, I. Gerson, and N. Massey, Text-tospeech conversion with neural networks : a recurrent TDNN approach, Proceedings of Interspeech, 1998.

P. Taylor, A. W. Black, and R. Caley, The architecture of the Festival speech synthesis system, Proc. of the ESCA Workshop in Speech Synthesis, pp.147-151, 1998.

J. R. Yi, Natural-sounding speech synthesis using variable-length units, 1998.

D. Cristo and A. , Le cadre accentuel du français contemporain : essai de modélisation, Premì ere partie " . In : Langues 2.3, pp.184-205, 1999.

E. Fosler-lussier, Multi-level decision trees for static and dynamic pronunciation models, Proceedings of the European Conference on Speech Communication and Technology (Eurospeech), 1999.

F. Schiel, Automatic phonetic transcription of non-prompted speech, Proceedings of the International Congresses of Phonetic Sciences, pp.607-610, 1999.

T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis, Proceedings of Eurospeech, pp.2347-2350, 1999.

J. Carrier, L'´ ecole et le multimédia. Centre National de Documentation Pédagogique, Hachetté education, 2000.

A. Conkie, C. Mark, A. K. Beutnagel, . Syrdal, E. Philip et al., Preselection of candidate units in a unit selection-based text-to-speech synthesis, 2000.

B. Post, Tonal and phrasal structures in French intonation, 2000.

K. Sjölander, J. Et, and . Beskow, Wavesurfer -an open source speech tool, Proceedings of Interspeech, pp.464-467, 2000.

K. Toutanova, D. Christopher, and . Manning, Enriching the knowledge sources used in a maximum entropy part-of-speech tagger, Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics -, pp.63-70, 2000.
DOI : 10.3115/1117794.1117802

C. Barras, E. Geoffrois, Z. Wu, and M. Liberman, Transcriber: Development and use of a tool for assisting speech corpora production, Speech Communication, vol.33, issue.1-2, pp.1-2, 2001.
DOI : 10.1016/S0167-6393(00)00067-4

R. E. Donovan, A new distance measure for costing spectral discontinuities in concatenative speech synthesizers, 2001.

J. Lafferty, A. Mccallum, C. Fernando, and . Pereira, Conditional random fields : probabilistic models for segmenting and labeling sequence data, 2001.

M. Schröder, Emotional Speech Synthesis : A Review, Proc of Eurospeech, pp.561-564, 2001.

Y. Stylianou and A. K. Syrdal, Perceptual and objective detection of discontinuities in concatenative speech synthesis, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), pp.837-840, 2001.
DOI : 10.1109/ICASSP.2001.941045

P. Taylor, A. W. Black, and R. Caley, Heterogeneous relation graphs as a formalism for representing linguistic information, Speech Communication, vol.33, issue.1-2, pp.1-2, 2001.
DOI : 10.1016/S0167-6393(00)00074-1

R. Bates and M. Ostendorf, Modeling pronunciation variation in conversational speech using prosody, ISCA Tutorial and Research Workshop (ITRW) on Pronunciation Modeling and Lexicon Adaptation for Spoken Language Technology, 2002.

A. W. Black, P. Taylor, R. Caley, and R. Clark, The Festival speech synthesis system, 2002.

P. Boersma, Praat, a system for doing phonetics by computer, pp.341-345, 2002.

H. Cunningham, D. Maynard, K. Bontcheva, and V. Tablan, GATE, Proceedings of the 40th Annual Meeting on Association for Computational Linguistics , ACL '02, pp.168-175, 2002.
DOI : 10.3115/1073083.1073112

A. Bell, D. Jurafsky, E. Fosler-lussier, C. Girand, M. Gregory et al., Effects of disfluencies, predictability, and utterance position on word form variation in English conversation, The Journal of the Acoustical Society of America, vol.113, issue.2, 2003.
DOI : 10.1121/1.1534836

I. Guyon and A. Elisseeff, An introduction to variable and feature selection, Journal of machine learning research 3.Mar, pp.1157-1182, 2003.

R. K. Moore, A comparison of the data requirements of automatic speech recognition systems and human listeners, 2003.

K. Chen and M. Hasegawa-johnson, Modeling pronunciation variation using artificial neural networks for English spontaneous speech, Proceedings of the Annual Conference of the International Speech Communication Association, 2004.

D. Ferrucci, A. Et, and . Lally, UIMA: an architectural approach to unstructured information processing in the corporate research environment, Natural Language Engineering, vol.10, issue.3-4, pp.3-4, 2004.
DOI : 10.1017/S1351324904003523

R. Kumar, A genetic algorithm for unit selection based speech synthesis, Eighth International Conference on Spoken Language Processing, 2004.

M. Adda-decker, P. Boula-de-mareüil, G. Adda, and L. Lamel, Investigating syllabic structures and their variation in spontaneous French, Speech Communication 46, 2005.
DOI : 10.1016/j.specom.2005.03.006

J. Carletta, S. Evert, U. Heid, and J. Kilgour, The NITE XML Toolkit: Data Model and Query Language, Language Resources and Evaluation 39, pp.313-334, 2005.
DOI : 10.3758/BF03200802

N. Marty, Informatique et nouvelles pratiques d'´ ecriture, 2005.

M. A. Pitt, K. Johnson, E. Hume, S. Kiesling, and W. Raymond, The Buckeye corpus of conversational speech: labeling conventions and a test of transcriber reliability, Speech Communication, vol.45, issue.1, 2005.
DOI : 10.1016/j.specom.2004.09.001

K. R. Scherer, What are emotions? And how can they be measured?, Social Science Information, vol.42, issue.1, pp.695-729, 2005.
DOI : 10.1017/CBO9780511521256

M. Viswanathan, . Et-madhubalan, and . Viswanathan, Measuring speech quality for text-to-speech systems: development and assessment of a modified mean opinion score (MOS) scale, Computer Speech & Language, vol.19, issue.1, 2005.
DOI : 10.1016/j.csl.2003.12.001

D. Ferrucci, A. Lally, D. Gruhl, E. Epstein, M. Schor et al., Towards an interoperability standard for text and multi-modal analytics, IBM Research Report, 2006.

M. Garcia, C. D. 'alessandro, and G. Bailly, A joint prosody evaluation of French text-tospeech synthesis systems, Proc. of LREC, 2006.

K. Prahallad, A. W. Black, and R. Mosur, Sub-Phonetic Modeling For Capturing Pronunciation Variations For Conversational Speech Synthesis, 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings, 2006.
DOI : 10.1109/ICASSP.2006.1660155

R. A. Clark, K. Richmond, and S. King, Multisyn: Open-domain unit selection for the Festival speech synthesis system, Speech Communication 49, pp.317-330, 2007.
DOI : 10.1016/j.specom.2007.01.014
URL : https://hal.archives-ouvertes.fr/hal-00499177

T. Lambert, N. Braunschweiler, and S. Buchholz, How (not) to select your voice corpus : Random selection vs. phonologically balanced, Proc. of SSW6, 2007.

H. Zen, T. Nose, J. Yamagishi, S. Sako, T. Masuko et al., The HMM-based speech synthesis system (HTS) version 2, Speech Synthesis Workshop (SSW), pp.294-299, 2007.

J. Chevelu, N. Barbot, O. Boeffard, and A. Delhay, Comparing Set-Covering Strategies for Optimal Corpus Design, 2008.

O. Goubanova and S. King, Bayesian networks for phone duration prediction, Speech communication 50, pp.301-311, 2008.
DOI : 10.1016/j.specom.2007.10.002
URL : https://hal.archives-ouvertes.fr/hal-00499198

J. Yamagishi, Z. Ling, and S. King, Robustness of HMM-based speech synthesis, Science And Technology, pp.2-5, 2008.

A. Bell, J. M. Brenier, M. Gregory, C. Girand, and D. Jurafsky, Predictability effects on durations of content and function words in conversational English, Journal of Memory and Language, vol.60, issue.1, 2009.
DOI : 10.1016/j.jml.2008.06.003

D. Cadic, C. Boidin-et-christophe, and D. , Vocalic sandwich , a unit designed for unit selection TTS, Tenth Annual Conference of the International Speech Communication Association. 1, pp.2079-2082, 2009.

M. Eskenazi, An overview of spoken language technology for education, Speech Communication 51, pp.832-844, 2009.
DOI : 10.1016/j.specom.2009.04.005

M. Eskenazi, An overview of spoken language technology for education, Speech Communication, vol.51, issue.10, pp.832-844, 2009.
DOI : 10.1016/j.specom.2009.04.005

J. Goldman, A. Auchlin, and A. C. Simon, Discrimination de styles de parole par analyse prosodique semi-automatique, 2009.

Z. Handley, Is text-to-speech synthesis ready for use in computer-assisted language learning?, Speech Communication, vol.51, issue.10, pp.906-919, 2009.
DOI : 10.1016/j.specom.2008.12.004
URL : https://hal.archives-ouvertes.fr/hal-00558516

A. R. Rebordao, S. M. Ferreira, A. Masum, K. Hirose, and N. Minematsu, How to Improve TTS Systems for Emotional Expressivity, Proceedings of Interspeech, pp.524-527, 2009.

M. Schröder, Expressive speech synthesis : Past, present, and possible futures " . In : Affective information processing, pp.111-126, 2009.

. Vazirnezhad, F. Bahram, . Almasganj, M. Seyed, and . Ahadi, Hybrid statistical pronunciation models designed to be trained by a medium-size corpus, Computer Speech & Language, vol.23, issue.1, 2009.
DOI : 10.1016/j.csl.2008.02.001

S. J. Young, The HTK Book, version 3.4, 2009.

D. Cadic, C. Boidin-et-christophe, and D. , Towards optimal TTS corpora, Proceedings of the Seventh International Conference on Language Resources and Evaluation ADAPTATION DU MOTEUR DE SYNTH`ESESYNTH` SYNTH`ESE, pp.99-104, 2010.

D. Cadic and D. Et-christophe, High Quality TTS Voices Within One Day, Seventh ISCA Workshop on Speech Synthesis, 2010.

S. Calhoun, J. Carletta, M. Jason, N. Brenier, D. Mayo et al., The NXT-format Switchboard Corpus: a rich resource for investigating the syntax, semantics, pragmatics and prosody of dialogue, Language Resources and Evaluation 44, pp.387-419, 2010.
DOI : 10.1075/pbns.16.12pri

A. Gelan, Language and Text-to-Speech Technologies for Highly Accessible Language & Culture Learning, International Journal of Emerging Technologies in Learning (iJET), vol.6, issue.2, 2010.
DOI : 10.3991/ijet.v6i2.1529

T. Lavergne, O. Cappé, and F. Yvon, Practical very large scale CRFs, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, 2010.

F. Alías, L. Formiga, and X. Llorá, Efficient and reliable perceptual weight tuning for unit-selection text-to-speech synthesis based on active interactive genetic algorithms: A proof-of-concept, Speech Communication, vol.53, issue.5, 2011.
DOI : 10.1016/j.specom.2011.01.004

. Avanzi, N. Mathieu, A. Obin, B. Lacheret, and . Victorri, Toward a continuous modeling of french prosodic structure : Using acoustic features to predict prominence location and prominence degree, pp.2033-2036, 2011.
URL : https://hal.archives-ouvertes.fr/halshs-00636485

F. Hinterleitner, G. Neitzel, S. Moller, and C. Norrenbrock, An Evaluation Protocol for the Subjective Assessment of Textto-Speech in Audiobook Reading Tasks, Proc. of Blizzard Challenge Workshop, 2011.

I. Illina, D. Fohr, and D. Jouvet, Grapheme-to-Phoneme Conversion using Conditional Random Fields, Proc. of Interspeech, pp.2313-2316, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00614981

N. Obin, MeLos : Analysis and modelling of speech prosody and speaking style, Thèse de doct, 2011.
URL : https://hal.archives-ouvertes.fr/tel-00694687

B. Post, The multi-faceted relation between phrasing and intonation contours in French In : Intonational Phrasing in Romance and Germanic : Crosslinguistic and bilingual studies, pp.44-74, 2011.

A. Stolcke, J. Zheng, W. Wang, and V. Abrash, SRILM at sixteen : Update and outlook, Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop, p.5, 2011.

D. Wang and S. King, Letter-to-Sound Pronunciation Prediction Using Conditional Random Fields, IEEE Signal Processing Letters 18.2, pp.122-125, 2011.
DOI : 10.1109/LSP.2010.2098440
URL : http://www.cstr.inf.ed.ac.uk/downloads/publications/2011/wang_ieeesigprocletters2011.pdf

J. Duddington, eSpeak text to speech, 2012.

S. King and V. Karaiskos, The blizzard challenge 2012, Proc. Blizzard Challenge workshop, 2012.

C. Vaudable, Analyse et reconnaissance desémotionsdesémotions lors de conversations de centres d'appels, 2012.

S. Buchholz, J. Latorre, and K. Yanagisawa, Crowdsourced Assessment of Speech Synthesis, Crowdsourcing for Speech Processing, 2013.
DOI : 10.1109/TASL.2012.2187195

P. C. Dilts, Modelling phonetic reduction in a corpus of spoken English using random forests and mixed-effects regression, Thèse de doct, 2013.

P. Karanasou, F. Yvon, T. Lavergne, and L. Lamel, Discriminative training of a phoneme confusion model for a dynamic lexicon in ASR, 2013.

S. Brognaux, B. Picart, T. Drugman, and D. Louvain, Speech synthesis in various communicative situations : impact of pronunciation variations, pp.1524-1528, 2014.

. Kolluru, V. Balakrishna, J. Wan, K. Latorre, . Yanagisawa et al., Generating multiple-accent pronunciations for TTS using joint sequence model interpolation, Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech), 2014.

J. Latorre, K. Yanagisawa, V. Wan, B. Kolluru, J. Mark et al., Speech intonation for TTS : Study on evaluation methodology, Proc. of Interspeech, 2014.

I. Sainz, E. Navas, I. Hernaez, A. Bonafonte, and F. Campillo, TTS evaluation campaign with a common Spanish database, Proc. of LREC, 2014.

D. Tihelka, J. Matou?ek, and Z. Hanzlí?ek, Modelling F0 Dynamics in??Unit??Selection??Based??Speech??Synthesis, In : Text, Speech and Dialogue, vol.1, pp.457-464, 2014.
DOI : 10.1007/978-3-319-10816-2_55

A. Mogn and . Olier, Langue bretonne et nouvelles technologies : une vitalitévitalitéà soutenir, Coloque sur les technologies pour les langues régionales de France, pp.71-76, 2015.

E. Delais-roussarie, . Post, C. Avanzi, . Buthke, . Di-cristo et al., Intonational Phonology of French : Developing a ToBI system for French " . In : Intonation in Romance, pp.63-100, 2015.
URL : https://hal.archives-ouvertes.fr/halshs-01428391

K. Prahallad and A. Vadapalli, The Blizzard Challenge, Blizzard Challenge 2015 workshop, 2015.

K. Bartkova, D. Jouvet, and E. Delais-roussarie, Prosodic parameters and prosodic structures of French emotional data, Speech Prosody 2016, 2016.
DOI : 10.21437/SpeechProsody.2016-132
URL : https://hal.archives-ouvertes.fr/hal-01293516

E. Delais-roussarie, D. Lolive, H. Yoo, and D. Guennec, Rhythmic patterns and literary genres in synthesized speech, Speech Prosody 2016, 2016.
DOI : 10.21437/SpeechProsody.2016-12
URL : https://hal.archives-ouvertes.fr/hal-01338873

S. King and V. Karaiskos, The Blizzard Challenge 2016, Blizzard Challenge 2016 workshop, 2016.

K. Livescu, P. Jyothi, and E. Fosler-lussier, Articulatory feature-based pronunciation modeling, Computer Speech & Language, vol.36, pp.212-232, 2016.
DOI : 10.1016/j.csl.2015.07.003
URL : https://doi.org/10.1016/j.csl.2015.07.003

. Arik, O. Sercan, M. Chrzanowski, A. Coates, G. Diamos et al., Deep Voice : Real-time Neural Text-to-Speech, 2017.

J. Chevelu, G. Lecorvé, and D. Lolive, ROOTS : a toolkit for easy, fast and consistent processing of large sequential annotated data collections, Proceedings of the international conference on Language Resources and Evaluation (LREC), 2014.
URL : https://hal.archives-ouvertes.fr/hal-00974628

D. Guennec and D. Lolive, Unit Selection Cost Function Exploration Using an A* Based Text-to-Speech System, Proceedings of the international conference on Text, Speech and Dialogue (TSD) 2014, 2014.
DOI : 10.1007/978-3-319-10816-2_52
URL : https://hal.archives-ouvertes.fr/hal-01133321

M. Avanzi, G. Christodoulides, D. Lolive, E. Delais-roussarie, and N. Barbot, Towards the Adaptation of Prosodic Models for Expressive Text-To-Speech Synthesis, Proceedings of the International Conference on Speech Communication and Technology (Interspeech), 2014.
URL : https://hal.archives-ouvertes.fr/hal-01133316

R. Qader, G. Lecorvé, D. Lolive, and P. Sébillot, Probabilistic Speaker Pronunciation Adaptation for Spontaneous Speech Synthesis Using Linguistic Features, Proceedings of the International Conference on Statistical Language and Speech Processing, p.2015
DOI : 10.1109/LSP.2010.2098440
URL : https://hal.archives-ouvertes.fr/hal-01181192

G. Lecorvé and D. Lolive, Adaptative statistical utterance phonetization for french, Proceedings of the International Conference on Acoustics, Speech and Signal Processing, p.2015

J. Chevelu, D. Lolive, S. L. Maguer, and D. Guennec, How to Compare TTS Systems : A New Subjective Evaluation Methodology Focused on Differences, Proceedings of the International Conference on Speech Communication and Technology (Interspeech), 2015.
URL : https://hal.archives-ouvertes.fr/hal-01199082

R. Qader, G. Lecorvé, D. Lolive, and P. Sébillot, Ajout automatique de disfluences pour la synthèse de la parole spontanée : formalisation et preuve de concept, Actes de la conférence TALN 2017, p.2017

C. Fayet, A. Delhay, D. Lolive, and P. Marteau, Big Five vs. Prosodic Features as Cues to Detect Abnormality in SSPNET-Personality Corpus, Interspeech 2017, p.2017
DOI : 10.21437/Interspeech.2017-1194

. Demanì-ere-complémentaire, es mon recrutement en tant que ma??trema??tre de conférences En effet, une nouvelle formation par apprentissage est ouverte depuis septembre 2009à2009à l'Enssat. Cette formation s'oriente vers la spécialité Informatique, Multimédia et Réseaux J'ai participéparticipéà la mise en place de cette formation etparticulì erement celle de ladeuxì eme année de formation (construction de la maquette d'enseignements, mise en place opérationnelle des enseignements) Par la suite, j'aí eté responsable de ladeuxì eme année depuis la création de lafilì ere et jusqu'en 2014. Depuis septembre 2015, je suis responsable de la 2 e année de la formation d'ingénieurs en Informatique de l'Enssat. Pour ses deux responsabilités, j'assume depuis mon recrutement une charge d'organisation des enseignements sur l'année

. De, je suiségalementsuiségalement membré elu au conseil d'´ ecole de l'Enssat. Ce conseil est notamment en charge des grandes orientations prises par l'´ ecole, 2012.

. Enfin, une dizaine de jours que doivent effectuer leséì eves-ingénieurs par apprentissage de 2 e année (environ 25étudiants25étudiants) Cela inclut notamment l'organisation d'un cours dans unétablissementunétablissement partenaire, Par exemple, en 2015 et 2016, j'ai organisé un cours délivré par John Kelleher, au Dublin Institute of Technology, en Irlande. Ces interactions ontégalementontégalement débouché sur la création d'un accord Erasmus entre nos deuxétablissementsdeuxétablissements

. Prenant-doiventêtredoiventêtre-traitées, Une difficulté est que ces deuxprobì emes sont très liés

. Deux-principalesévolutionsprincipalesévolutions-sontàsontà-noter-sur-la-durée-du and . Projet, Lapremì ere concerne la restriction au public FLM en raison des différences importantes entre public FLE etéìetéì eves de cycle 2. Ladeuxì eme est la modification des priorités par rapportàrapportà la définition du profil d'apprenant et la génération d'exercices. Cela nous a conduitàconduità travailler prioritairement sur la définition des caractéristiques du profil servant de basè a la génération d'exercices

. Demanì-ere-générale, La spécification des exercices a doncétédoncété réalisée de manì ere prioritaire en collaboration avec des didacticiens (CREAD) et des enseignants intervenant en cycle 2. Cettepremì ere phase a permis de retenir trois types d'exercicesà exercicesà développer dans la suite du projet : les oppositions phonologiques, la segmentation en mots et la dictée. La construction d'unepremì ere version de la plateforme distribuée a pû etre réalisée enparalì ele. Celle-ci a nécessité un travail important de spécification des services rendus par chaque partenaire

A. B. Projets and . De-recherche-synthèse-va-pouvoir-rendre-les-erreurs-perceptibles, De plus, se cantonneràcantonnerà une validation unique de la réponse de l'´eì eve n'apporte rien du point de vue pédagogique, ce qui nous a conduitàconduità imaginer un mécanisme d'essai-erreur permettantàpermettantà l'´eì eve de se confronteràconfronterà ses difficultés. Demanì ere plus spécifique, le cas de la dictée a nécessité desétudes desétudes approfondies pour créer une voix adaptée, des modèles prosodiques reflétant la production d'un enseignant tout en se calant sur la vitesse de frappe de l'´eì eve. Plusieurs probì emes se posent alors : ? Comment construire une voix dictée ? Quelles consignes appliquer lors de l'enregistrement ? Quel script d'enregistrement utiliser ?

´. Livres-libres-de-droits-a and . Construit, Pour cela, une adaptation des structures de données existantesàexistantesà l'IRISA a dûdûêtre effectuée. L'´ evaluation de la plateforme a ´ eté menée en plusieursétapesplusieursétapes au cours du projet

/. Ergonomie and . De-la-plateforme, validation des contenus des exercices par les enseignants,premì ere validation du principe de l'utilisation de la synthèse et retours sur le bénéfice de l'approche