A Robust Lip Tracking System for the Acoustic to Articulatory Inversion

Jingying Chen 1 Yves Laprie 1 Marie-Odile Berger 2
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
2 ISA - Models, algorithms and geometry for computer graphics and vision
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : The acoustic to articulatory inversion of speech which refers to the mapping from the acoustic signal to the articulatory, is an interesting problem. Given the acoustic signal, the recovery of the articulatory state is considered difficult. The reason is the "one-to-many" nature of the acoustic-to-articulatory inversion problem: a given articulatory state has always only one acoustic realization but an acoustic signal can be the outcome of more than one articulatory states. In order to solve the one-to-many problem of the inversion, visual information complementary to acoustic signal is used. Hence, a robust lip tracking system to provide visual information (such as the width and height of mouth) for the acoustic-to-articulatory inversion is developed in this paper. The proposed approach uses a combination of motion, color and structure information of the mouth area to track lip feature points. This technique is designed to be effective and robust. It has the advantages to detect the lip feature points automatically and recover the feature points lost during tracking process. Encouraging results have been obtained using the proposed approach.
Document type :
Conference papers
Complete list of metadatas

https://hal.inria.fr/inria-00099907
Contributor : Publications Loria <>
Submitted on : Tuesday, September 26, 2006 - 10:06:15 AM
Last modification on : Thursday, January 11, 2018 - 6:19:57 AM
Long-term archiving on : Wednesday, March 29, 2017 - 1:24:23 PM

Identifiers

  • HAL Id : inria-00099907, version 1

Collections

Citation

Jingying Chen, Yves Laprie, Marie-Odile Berger. A Robust Lip Tracking System for the Acoustic to Articulatory Inversion. 6th IASTED International Conference on Signal and Image Processing - SIP'2004, 2004, Honolulu, Hawaii, USA, 6 p. ⟨inria-00099907⟩

Share

Metrics

Record views

326

Files downloads

53