J. J. Godfrey, E. C. Holliman, and J. Mcdaniel, SWITCHBOARD: telephone speech corpus for research and development, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, p.517520, 1992.
DOI : 10.1109/ICASSP.1992.225858

. Choukri, Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News, Language Resources and Evaluation Conference, 2006.

M. Avanzi, A. C. Simon, J. P. Goldman, A. Auchlin, and C. , An annotated corpus for french prominence studies, Proceedings of Prosodic Prominence: Perceptual and Automatic Identication, Speech Prosody, 2010.

H. Mcgurk and J. Macdonald, Hearing lips and seeing voices, Nature, vol.65, issue.5588, p.748756, 1976.
DOI : 10.1038/264746a0

E. K. Patterson, S. Gurbuz, Z. Tufekci, and J. N. Gowdy, CUAVE: A new audio-visual database for multimodal human-computer interface research, IEEE International Conference on Acoustics, Speech, and Signal Processing, p.20172020, 2002.

B. Lee, M. Hasegawa-johnson, C. Goudeseune, S. Kamdar, S. Borys et al., AVICAR: Audio-Visual Speech Corpus in a Car Environment, 2004.

A. Wrench and W. Hardcastle, A multichannel articulatory speech database and its application for automatic speech recognition, Proceedings of the 5th Seminar on Speech Production, p.305308, 2000.

T. Frank, M. Hoch, and G. Trogemann, Automated Lip-Sync for 3D- Character Animation, 15th IMACS World Congress on Scientic Computation, Modelling and Applied Mathematics, p.2429, 1997.

S. Nakamura, Statistical multimodal integration for audio-visual speech processing, IEEE Transactions on Neural Networks, vol.13, issue.4, p.854866, 2002.
DOI : 10.1109/TNN.2002.1021886

L. Rabiner, A tutorial on hidden Markov models and selected applications in speech recognition, Proceedings of the IEEE, p.257286, 1989.

J. Park and H. Ko, Real-Time Continuous Phoneme Recognition System Using Class-Dependent Tied-Mixture HMM With HBT Structure for Speech-Driven Lip-Sync, IEEE Transactions on Multimedia, vol.10, issue.7, p.12991306, 2008.

S. Foo and L. Dong, Recognition of Visual Speech Elements Using Hidden Markov Models, Advances in Multimedia Information Processing, p.153173, 2002.
DOI : 10.1007/3-540-36228-2_75

G. Gibert, G. Bailly, D. Beautemps, F. Elisei, and R. Brun, Analysis and synthesis of the three-dimensional movements of the head, face, and hand of a speaker using cued speech, The Journal of the Acoustical Society of America, vol.118, issue.2, pp.1144-1153, 2005.
DOI : 10.1121/1.1944587

G. Gibert, Conception et évaluation d'un système de synthèse 3D de Langue française Parlée Complétée (LPC) à partir du texte, 2006.