, Speecon manually pitch-marked reference database for Spanish, pp.866-498

K. Bartkova, M. Dargnat, D. Jouvet, and L. Lee, Annotation of discourse particles in French over a large variety of speech corpora, ACor4French -Les corpus annotés du français, TALN'2017 -Traitement Automatique des Langues Naturelles, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01585540

K. Bartkova and D. Jouvet, Automatic Detection of the Prosodic Structures of Speech Utterances, SPECOM -15th International Conference on Speech and Computer -2013, vol.8113, pp.1-8, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00834318

K. Bartkova and D. Jouvet, Links between Manual Punctuation Marks and Automatically Detected Prosodic Structures, Speech Prosody, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00998031

K. Bartkova and D. Jouvet, Analysis of prosodic correlates of emotional speech data, ExLing 2018 -9th Tutorial and Research Workshop on Experimental Linguistics, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01889932

K. Bartkova, D. Jouvet, and E. Delais-roussarie, Prosodic Parameters and Prosodic Structures of French Emotional Data, Speech Prosody 2016. Speech Prosody, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01293516

M. Benzeghiba, R. De-mori, O. Deroo, S. Dupont, T. Erbes et al., Automatic speech recognition and speech variability: A review, Speech Communication, 2007.
URL : https://hal.archives-ouvertes.fr/inria-00616506

M. Bisani and H. Ney, Joint-sequence models for grapheme-to-phoneme conversion, Speech communication, vol.50, issue.5, pp.434-451, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00499203

P. Boersma, Accurate short-term analysis of the fundamental frequency and the harmonicsto-noise ratio of a sampled sound, Proc. of the Institute of Phonetic Sciences, vol.17, pp.97-110, 1993.

P. Boersma and D. Weenink, Praat: doing phonetics by computer, 2011.

A. Bonneau, D. Fohr, I. Illina, D. Jouvet, O. Mella et al., Gestion d'erreurs pour la fiabilisation des retours automatiques en apprentissage de la prosodie d'une langue seconde, Traitement Automatique des Langues, vol.53, issue.3, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00834278

A. Camacho and J. G. Harris, A sawtooth waveform inspired pitch estimator for speech and music, The Journal of the Acoustical Society of America, vol.124, issue.3, pp.1638-1652, 2008.

A. De-cheveigné and H. Kawahara, YIN, a fundamental frequency estimator for speech and music, Journal of the Acoustical Society of America, vol.111, issue.4, pp.1917-1930, 2002.

M. Dargnat, K. Bartkova, and D. Jouvet, Discourse Particles In French: Prosodic Parameters Extraction and Analysis, International Conference on Statistical Language and Speech Processing, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01184197

B. Deng, D. Jouvet, Y. Laprie, I. Steiner, and A. Sini, Towards Confidence Measures on Fundamental Frequency Estimations, IEEE International Conference on Acoustics, Speech and Signal Processing, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01493168

M. Eskenazi, An overview of spoken language technology for education, Speech Communication, vol.51, issue.10, pp.832-844, 2009.

D. Fohr, O. Mella, and D. Jouvet, De l'importance de l'homogénéisation des conventions de transcription pour l'alignement automatique de corpus oraux de parole spontanée, 8es Journées Internationales de Linguistique de Corpus (JLC2015), 2015.

P. Ghahremani, B. Babaali, D. Povey, K. Riedhammer, J. Trmal et al., A pitch extraction algorithm tuned for automatic speech recognition, IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp.2494-2498, 2014.

I. Illina, D. Fohr, and D. Jouvet, Multiple Pronunciation Generation using Grapheme-toPhoneme Conversion based on Conditional Random Fields, XIV International Conference "Speech and Computer" (SPECOM'2011), 2011.

I. Illina, D. Fohr, and D. Jouvet, Génération des prononciations de noms propresà l'aide des champs aéatoires conditionnels, JEP-TALN-RECITAL 2012, 2012.

D. Jouvet and K. Bartkova, Acoustical Frame Rate and Pronunciation Variant Statistics, International Conference on Statistical Language and Speech Processing, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01184195

D. Jouvet, K. Bartkova, M. Dargnat, and L. Lee, Analysis and Automatic Classification of Some Discourse Particles on a Large Set of French Spoken Corpora, SLSP'2017, 5th International Conference on Statistical Language and Speech Processing, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01585567

D. Jouvet and Y. Laprie, Performance Analysis of Several Pitch Detection Algorithms on Simulated and Real Noisy Speech Data, EUSIPCO'2017, 25th European Signal Processing Conference, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01585554

D. Jouvet, L. Mesbahi, A. Bonneau, D. Fohr, I. Illina et al., Impact of Pronunciation Variant Frequency on Automatic Non-Native Speech Segmentation, 5th Language & Technology Conference -LTC'11, pp.145-148, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00639118

D. Jurafsky, R. Bates, N. Coccaro, R. Martin, M. Meteer et al., Automatic detection of discourse structure for speech recognition and understanding, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings, pp.88-95, 1997.

H. Kawahara, A. De-cheveigné, H. Banno, T. Takahashi, and T. Irino, Nearly defect-free f0 trajectory extraction for expressive speech modifications based on straight, Interspeech. pp, pp.537-540, 2005.

H. Kawahara, J. Estill, and O. Fujimura, Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system straight, MAVEBA. pp, pp.59-64, 2001.

H. Kawahara, H. Katayose, A. De-cheveigné, and R. D. Patterson, Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of f0 and periodicity, Eurospeech. pp, pp.2781-2784, 1999.
URL : https://hal.archives-ouvertes.fr/hal-01105607

J. Kolá? and L. Lamel, Development and evaluation of automatic punctuation for french and english speech-to-text, Thirteenth Annual Conference of the International Speech Communication Association, 2012.

P. Král, J. Kleckova, and C. Cerisara, Sentence modality recognition in french based on prosody, International Conference on Enformatika, vol.8, pp.185-188, 2005.

A. Kulkarni, C. Vincent, and J. Denis, Layer adaptation for transfer of expressivity in speech synthesis, Proceedings of LTC'2019, 9th Language and Technology Conference, 2019.

R. B. Lanjewar and D. Chaudhari, Speech emotion recognition: a review, International Journal of Innovative Technology and Exploring Engineering (IJITEE), vol.2, pp.68-71, 2013.

L. Lee, K. Bartkova, M. Dargnat, and D. Jouvet, Prosodic and Pragmatic Values of Discourse Particles in French, ExLing 2018 -9th Tutorial and Research Workshop on Experimental Linguistics, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01889925

L. Lee, K. Bartkova, D. Jouvet, M. Dargnat, and K. Yvon, Can prosody meet pragmatics? Case of discourse particles in French, to appear in Proceedings of ICPhS'2019, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02177202

A. Margolis and M. Ostendorf, Question detection in spoken conversations using textual conversations, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol.2, pp.118-124, 2011.

P. Martin, Prosodic and rhythmic structures in french, Linguistics, vol.25, issue.5, pp.925-950, 1987.
URL : https://hal.archives-ouvertes.fr/halshs-01450547

L. Mesbahi, D. Jouvet, A. Bonneau, D. Fohr, I. Illina et al., Reliability of nonnative speech automatic segmentation for prosodic feedback, Workshop on Speech and Language Technology in Education -SLaTE 2011. ISCA, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00614930

J. R. Novak, N. Minematsu, and K. Hirose, Phonetisaurus: Exploring grapheme-to-phoneme conversion with joint n-gram models in the wfst framework, Natural Language Engineering, vol.22, issue.6, pp.907-938, 2016.

L. Orosanu and D. Jouvet, Combining lexical and prosodic features for automatic detection of sentence modality in French, International Conference on Statistical Language and Speech Processing, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01184196

L. Orosanu, D. Jouvet, D. Fohr, I. Illina, and A. Bonneau, Combining criteria for the detection of incorrect entries of non-native speech in the context of foreign language learning, SLT 2012 -4th IEEE Workshop on Spoken Language Technology. Miami, United States, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00753458

G. Pirker, M. Wohlmayr, S. Petrik, and F. Pernkopf, A pitch tracking corpus with evaluation on multipitch tracking scenario, pp.1509-1512, 2011.

V. M. Quang, E. Castelli, and P. N. Yên, A decision tree-based method for speech processing: question sentence detection, International Conference on Fuzzy Systems and Knowledge Discovery, pp.1205-1212, 2006.

K. Rao, F. Peng, H. Sak, and F. Beaufays, Grapheme-to-phoneme conversion using long shortterm memory recurrent neural networks, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.4225-4229, 2015.

M. Schröder, Expressive speech synthesis: Past, present, and possible futures, Affective information processing, pp.111-126, 2009.

B. W. Schuller, Speech emotion recognition: Two decades in a nutshell, benchmarks, and ongoing trends, Communications of the ACM, vol.61, issue.5, pp.90-99, 2018.

N. Segal and K. Bartkova, Prosodic structure representation for boundary detection in spontaneous french, Proceedings of ICPhS, pp.1197-1200, 2007.

V. Sethu, J. Epps, and E. Ambikairajah, Speech based emotion recognition, Speech and Audio Processing for Coding, Enhancement and Recognition, pp.197-228, 2015.

R. Shadiev, W. Y. Hwang, and Y. M. Huang, Review of research on mobile language learning in authentic environments, Computer Assisted Language Learning, vol.30, issue.3-4, pp.284-303, 2017.

A. Sorin, T. Ramabadran, D. Chazan, R. Hoory, M. Mclaughlin et al., The ETSI extended distributed speech recognition (DSR) standards: client side processing and tonal language recognition evaluation, IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, vol.I, pp.129-132, 2004.

M. Stede and B. Schmitz, Discourse particles and discourse functions, Machine translation, vol.15, pp.125-147, 2000.

S. Strömbergsson, Today's most frequently used f 0 estimation methods, and their accuracy in estimating male and female pitch in clean speech, pp.525-529, 2016.

D. Talkin, A robust algorithm for pitch tracking (RAPT), Speech Coding and Synthesis, pp.495-518, 1995.

O. Viberg and Å. Grönlund, Mobile assisted language learning: A literature review, 11th World Conference on Mobile and Contextual Learning, 2012.

S. M. Witt and S. J. Young, Phone-level pronunciation scoring and assessment for interactive language learning, Speech communication, vol.30, issue.2-3, pp.95-108, 2000.

Z. Wu, O. Watts, and S. King, Merlin: An open source neural network speech synthesis system, pp.202-207, 2016.

K. Yao and G. Zweig, Sequence-to-sequence neural net models for grapheme-to-phoneme conversion, 2015.

H. Zen, T. Nose, J. Yamagishi, S. Sako, T. Masuko et al., The hmmbased speech synthesis system (hts) version 2.0, pp.294-299, 2007.