A Robust Lip Tracking System for the Acoustic to Articulatory Inversion

Jingying Chen 1 Yves Laprie 1 Marie-Odile Berger 2
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
2 ISA - Models, algorithms and geometry for computer graphics and vision
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : The acoustic to articulatory inversion of speech which refers to the mapping from the acoustic signal to the articulatory, is an interesting problem. Given the acoustic signal, the recovery of the articulatory state is considered difficult. The reason is the "one-to-many" nature of the acoustic-to-articulatory inversion problem: a given articulatory state has always only one acoustic realization but an acoustic signal can be the outcome of more than one articulatory states. In order to solve the one-to-many problem of the inversion, visual information complementary to acoustic signal is used. Hence, a robust lip tracking system to provide visual information (such as the width and height of mouth) for the acoustic-to-articulatory inversion is developed in this paper. The proposed approach uses a combination of motion, color and structure information of the mouth area to track lip feature points. This technique is designed to be effective and robust. It has the advantages to detect the lip feature points automatically and recover the feature points lost during tracking process. Encouraging results have been obtained using the proposed approach.
Type de document :
Communication dans un congrès
6th IASTED International Conference on Signal and Image Processing - SIP'2004, 2004, Honolulu, Hawaii, USA, 6 p, 2004
Liste complète des métadonnées

https://hal.inria.fr/inria-00099907
Contributeur : Publications Loria <>
Soumis le : mardi 26 septembre 2006 - 10:06:15
Dernière modification le : jeudi 11 janvier 2018 - 06:19:57
Document(s) archivé(s) le : mercredi 29 mars 2017 - 13:24:23

Fichiers

Identifiants

  • HAL Id : inria-00099907, version 1

Collections

Citation

Jingying Chen, Yves Laprie, Marie-Odile Berger. A Robust Lip Tracking System for the Acoustic to Articulatory Inversion. 6th IASTED International Conference on Signal and Image Processing - SIP'2004, 2004, Honolulu, Hawaii, USA, 6 p, 2004. 〈inria-00099907〉

Partager

Métriques

Consultations de la notice

288

Téléchargements de fichiers

34