S. Abd-el-malek, Observations on the morphology of the human tongue, Journal of Anatomy, vol.73, pp.201-210, 1939.

P. Badin, F. Elisei, G. Bailly, and Y. Tarabalka, An Audiovisual Talking Head for Augmented Speech Generation: Models and Animations Based on a Real Speaker???s Articulatory Data, Lecture Notes in Computer Science, vol.5098, pp.132-143, 2008.
DOI : 10.1007/978-3-540-70517-8_14

P. Badin, Y. Tarabalka, F. Elisei, and G. Bailly, Can you ???read??? tongue movements? Evaluation of the contribution of tongue display to speech understanding, Speech Communication, vol.52, issue.6, pp.493-503, 2010.
DOI : 10.1016/j.specom.2010.03.002

URL : https://hal.archives-ouvertes.fr/hal-00175680

J. Beskow, Trainable Articulatory Control Models for Visual Speech Synthesis, International Journal of Speech Technology, vol.7, issue.4, pp.335-349, 2004.
DOI : 10.1023/B:IJST.0000037076.86366.8d

C. T. Best, G. W. Mcroberts, and E. Goodell, Discrimination of non-native consonant contrasts varying in perceptual assimilation to the listener???s native phonological system, The Journal of the Acoustical Society of America, vol.109, issue.2, pp.775-794, 2001.
DOI : 10.1121/1.1332378

C. T. Bole and M. A. Lessler, Electromyography of the genioglossus muscles in man, Journal of Applied Physiology, vol.21, issue.6, pp.1695-1698, 1966.

A. Bosseler and D. W. Massaro, Development and Evaluation of a Computer-Animated Tutor for Vocabulary and Language Learning in Children with Autism, Journal of Autism and Developmental Disorders, vol.33, issue.6, pp.653-672, 2003.
DOI : 10.1023/B:JADD.0000006002.82367.4f

P. Carpentier and D. Pajoni, La langue: un ensemble musculaire complexe. Revue d'Orthopédie Dento-Faciale, pp.19-28, 1989.

M. Cohen and D. W. Massaro, Models and techniques in computer animation Modelling coarticulation in synthetic visual speech, pp.139-156, 1993.

M. M. Cohen, D. W. Massaro, and R. Clark, Training a talking head, Proceedings. Fourth IEEE International Conference on Multimodal Interfaces, pp.499-505, 2002.
DOI : 10.1109/ICMI.2002.1167046

V. Colotte, Y. Laprie, and A. Bonneau, Perceptual experiments on enhanced and slowed down speech sentences for second language acquisition, Proceedings of Eurospeech, pp.469-473, 2001.
URL : https://hal.archives-ouvertes.fr/inria-00100476

P. Cosi, E. Caldognetto, G. Perin, and C. Zmarich, Labial coarticulation modeling for realistic facial animation, Proceedings. Fourth IEEE International Conference on Multimodal Interfaces, pp.505-510, 2002.
DOI : 10.1109/ICMI.2002.1167047

R. Crawford, Teaching voiced velar stops to profoundly deaf children, using EPG???two case studies, Clinical Linguistics & Phonetics, vol.19, issue.3, pp.255-269, 1995.
DOI : 10.1044/jshr.1804.795

C. Cucchiarini, H. Strik, and L. Boves, Quantitative assessment of second language learners??? fluency by means of automatic speech recognition technology, The Journal of the Acoustical Society of America, vol.107, issue.2, pp.989-999, 2000.
DOI : 10.1121/1.428279

H. Dent, F. Gibbon, and W. Hardcastle, The application of electropalatography (EPG) to the remediation of speech disorders in school-aged children and young adults, International Journal of Language & Communication Disorders, vol.30, issue.2, pp.264-277, 1995.
DOI : 10.3109/13682829509082537

O. Engwall, Combining MRI, EMA and EPG measurements in a three-dimensional tongue model, Speech Communication, vol.41, issue.2-3, pp.303-329, 2003.
DOI : 10.1016/S0167-6393(02)00132-2

O. Engwall, Analysis of and feedback on phonetic features in pronunciation training with a virtual teacher, Computer Assisted Language Learning, vol.6, issue.1, pp.37-64, 2012.
DOI : 10.1016/S0167-6393(98)00048-X

M. Eskenazi, An overview of spoken language technology for education, Speech Communication, vol.51, issue.10, pp.832-844, 2009.
DOI : 10.1016/j.specom.2009.04.005

L. Fadiga, L. Craighero, G. Buccino, and G. Rizzolatti, Speech listening specifically modulates the excitability of tongue muscles: a TMS study, European Journal of Neuroscience, vol.96, issue.2, pp.399-402, 2002.
DOI : 10.1126/science.286.5449.2526

J. E. Flege, O. Bohn, and S. Jang, Effects of experience on non-native speakers' production and perception of English vowels, Journal of Phonetics, vol.25, issue.4, pp.437-470, 1997.
DOI : 10.1006/jpho.1997.0052

J. E. Flege, M. J. Munro, and I. R. Mackay, Factors affecting strength of perceived foreign accent in a second language, The Journal of the Acoustical Society of America, vol.97, issue.5, pp.3125-3134, 1995.
DOI : 10.1121/1.413041

K. Grauwinkel, B. Dewitt, and S. Fagel, Visualization of internal articulator dynamics and its intelligibility in synthetic audiovisual speech, Proceedings of 16th International Congress of Phonetics Sciences (ICPhS), pp.2173-2176, 2007.

K. Grauwinkel and S. Fagel, Visualization of internal articulator dynamics for use in speech therapy for children with Sigmatismus Interdentalis, Proceedings of International Conference on Auditory-Visual Speech Processing, pp.142-145, 2007.

W. Hardcastle and F. Gibbon, Electropalatography and its Clinical Applications, Instrumental Clinical Phonetics, pp.149-193, 1997.
DOI : 10.1002/9780470699119.ch6

V. Hazan, A. Sennema, A. Faulkner, M. Ortega-llebaria, M. Iba et al., The use of visual cues in the perception of non-native consonant contrasts, The Journal of the Acoustical Society of America, vol.119, issue.3, pp.1740-1751, 2006.
DOI : 10.1121/1.2166611

V. Hazan and A. Simpson, The Effect of Cue-Enhancement on Consonant Intelligibility in Noise: Speaker and Listener Effects, Language and Speech, vol.43, issue.3, pp.273-294, 2000.
DOI : 10.1177/00238309000430030301

P. Iverson, P. K. Kuhl, R. Akahane-yamada, E. Diesch, Y. Tohkura et al., A perceptual interference account of acquisition difficulties for non-native phonemes, Cognition, vol.87, issue.1, pp.47-57, 2003.
DOI : 10.1016/S0010-0277(02)00198-1

W. Katz, D. Garst, G. Carter, M. Mcneil, T. Fossett et al., Treatment of an individual with aphasia and apraxia of speech using EMA visually-augmented feedback, Brain and Language, vol.103, issue.1-2, pp.213-214, 2007.
DOI : 10.1016/j.bandl.2007.07.121

B. J. Kröger, V. Graf-bortscheller, and A. Lowit, Two-and three-dimensional visual articulatory models for pronunciation training and for treatment of speech disorders, Proceedings of Interspeech, pp.2639-2642, 2008.

P. Kuhl, Human adults and human infants show a " perceptual magnet effect " for the prototypes of speech categories, monkeys do not. Attention, Perception, and Psychophysics 50, pp.93-107, 1991.

S. Lambacher, A CALL Tool for Improving Second Language Acquisition of English Consonants by Japanese Learners, Computer Assisted Language Learning, vol.12, issue.2, pp.137-156, 1999.
DOI : 10.1076/call.12.2.137.5722

M. Li, C. Kambhamettu, and M. Stone, Automatic contour tracking in ultrasound images, Clinical Linguistics & Phonetics, vol.10, issue.6-7, pp.545-554, 2005.
DOI : 10.1121/1.402934

D. Massaro, S. Bigler, T. Chen, M. Perlman, and S. Ouni, Pronunciation Training: The Role of Eye and Ear, Proceedings of Interspeech, pp.2623-2626, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00327687

D. W. Massaro and J. Light, Read my tongue movements: Bimodal learning to perceive and produce non-native speech, Proceedings of Interspeech, pp.2249-2252, 2003.

D. W. Massaro, A computer-animated tutor for spoken and written language learning, Proceedings of the 5th international conference on Multimodal interfaces , ICMI '03, 2003.
DOI : 10.1145/958432.958466

D. W. Massaro, Y. Liu, T. H. Chen, and C. A. Perfetti, A Multilingual Embodied Conversational Agent for Tutoring Speech and Language Learning, Proceedings Interspeech, pp.825-828, 2006.

A. Neri, O. Mich, M. Gerosa, and D. Giuliani, The effectiveness of computer assisted pronunciation training for foreign language learning by children, Computer Assisted Language Learning, vol.16, issue.5, pp.393-408, 2008.
DOI : 10.1037/0096-1523.20.2.421

V. Pantelemidou, R. Herman, and J. Thomas, Efficacy of speech intervention using electropalatography with a cochlear implant user, Clinical Linguistics & Phonetics, vol.17, issue.4-5, pp.4-5, 2003.
DOI : 10.1080/0269920031000079958

D. Pisoni, S. Lively, and J. Logan, Perceptual learning of nonnative speech contrasts: Implications for theories of speech perception In Development of speech perception: The transition from recognizing speech sounds to spoken words, pp.121-166, 1994.

K. Probst, Y. Ke, and M. Eskenazi, Enhancing foreign language tutors ??? In search of the golden speaker, Speech Communication, vol.37, issue.3-4, pp.3-4, 2002.
DOI : 10.1016/S0167-6393(01)00009-7

H. Strik, A. Neri, and C. Cucchiarini, Speech technology for language tutoring, LangTech, 2008.

L. Wang, Y. Qian, M. Scott, G. Chen, and F. Soong, Computer-Assisted Audiovisual Language Learning, Computer, vol.45, issue.6, pp.45-83, 2012.
DOI : 10.1109/MC.2012.152

K. E. Watkins, A. P. Strafella, and T. Paus, Seeing and hearing speech excites the motor system involved in speech production, Neuropsychologia, vol.41, issue.8, pp.989-994, 2003.
DOI : 10.1016/S0028-3932(02)00316-0

P. Wik, Embodied conversational agents in computer assisted language learning, Speech Communication, vol.51, issue.10, pp.1024-1037, 2009.
DOI : 10.1016/j.specom.2009.05.006

URL : https://hal.archives-ouvertes.fr/hal-00558521