, Production of filled pauses in concatenative speech synthesis based on the underlying fluent sentence, Lecture Notes in Computer Science, vol.4629, pp.459-476, 2007.

[. Bahdanau, Neural machine translation by jointly learning to align and translate, 2014.

[. Betz, Micro-structure of disfluencies : Basics for conversational speech synthesis, 2015.

K. ;. Blankenship, J. Blankenship, and C. Kay, Hesitation phenomena in spontaneous english speech : A study in distribution, vol.20, pp.360-372, 1964.

J. Bruner and A. Deshpande, , 2017.

[. Cho, Learning phrase representations using RNN encoder-decoder for statistical machine translation, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01433235

[. Christenfeld, Filled pauses and gestures : it's not coincidence, Journal of Psycholinguistic Research, p.20, 1991.

C. , Empirical evaluation of gated recurrent neural networks on sequence modeling, 2014.

. Dall, Investigating automatic and human filled pause insertion for speech synthesis, 2014.

[. Goodfellow, Generative adversarial nets, Advances in neural information processing systems, pp.2672-2680, 2014.

[. Greff, LSTM : A search space odyssey, IEEE Transactions on Neural Networks and Learning Systems, vol.28, pp.2222-2232, 2017.

[. Hassan, Segmentation and disfluency removal for conversational speech translation, 2014.

[. Hermann, Teaching machines to read and comprehend, Advances in Neural Information Processing Systems 28, 2015.

M. Le, Q. Le, and T. Mikolov, Distributed representations of sentences and documents, Proceedings of the International Conference on Machine Learning, 2014.

O. ;. Maclay, H. Maclay, and C. E. Osgood, Hesitation phenomena in spontaneous english speech, vol.15, pp.19-44, 1959.

[. Mikolov, Linguistic regularities in continuous space word representations, Proceedings of the North American Chapter of the Association for Computionnal Linguistics : Human Language Technologies, pp.746-751, 2013.

[. Pascanu, On the difficulty of training recurrent neural networks, Proceedings of the 30-th International Conference on Machine Learning, p.28, 2013.

R. Qader-;-qader, Pronunciation and disfluency modelling for spontaneous speech synthesis, 2017.

[. Qader, Ajout automatique de disfluences pour la synthèse de la parole spontanée : formalisation et preuve de concept, Proceedings of TALN, 2014.

[. Rao, Grapheme-to-phoneme conversion using long-short-term memory recurrent neural networks, IEEE International Conference on Acoustics, Speech and Signal Processing, 2015.

R. L. Rose-;-rose, The communicative value of filled pauses in spontaneous speech, 1998.

E. E. Shriberg, Preliminaries to a theory of speech disfluencies, 1994.

[. Sutskever, Sequence to sequence learning with neural networks, Advances in Neural Information Processing Systems, vol.27, pp.3104-3112, 2014.

J. E. Tree-;-tree, The effects of false starts and repetitions on the processing of subsequent words in spontaneous speech, Journal of Memory and Language, vol.34, pp.709-738, 1995.

J. E. Tree, Listeners' uses of um and uh in speech comprehension, Memory and cognition, vol.29, pp.320-326, 2001.

[. Young, Recent trends in deep learning based natural language processing, 2017.