An Effective Lip Tracking Algorithm for Acoustic-to-Articulatory Inversion

Jingying Chen; Marie-Odile Berger; Yves Laprie

Communication Dans Un Congrès Année : 2004

An Effective Lip Tracking Algorithm for Acoustic-to-Articulatory Inversion

(1) , (2) , (1)

1
2

Jingying Chen

Fonction : Auteur

Analysis, perception and recognition of speech

Marie-Odile Berger

Fonction : Auteur
PersonId : 830601

Models, algorithms and geometry for computer graphics and vision

Yves Laprie

Fonction : Auteur
PersonId : 6696
IdHAL : yves-laprie
ORCID : 0000-0002-2379-6481
IdRef : 060274387

Analysis, perception and recognition of speech

Résumé

Although automatic speech recognition systems can now perform well under certain conditions, they still don't provide good results in real life conditions, especially in noisy environments. Several authors have suggested that using articulatory features rather than acoustic features as a basis for speech parameterization would help yield better recognition results. The articulatory features can be recovered from the speech signal by acoustic-to-articulatory inversion. Given the acoustic signal, the recovery of the articulatory state is considered difficult. The reason is the "one-to-many" nature of the acoustic-toarticulatory inversion problem: a given articulatory state has always only one acoustic realization but an acoustic signal can be the outcome of more than one articulatory states. Since visual information is complementary to acoustic information in the inversion, lip tracking is proposed in this paper to provide visual information of lip movement for the acoustic-to-articulatory inversion. Encouraging results have proven the effectiveness of this method which provides useful information (i.e. mouth width and height) for inversion.

Mots clés

lop tracking speech processing parole image image processing lip tracking

Domaines

Autre [cs.OH]

Fichier principal

A04-R-336.pdf (150.72 Ko)

Publications Loria : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00099905

Soumis le : mardi 26 septembre 2006-10:06:01

Dernière modification le : jeudi 15 février 2024-03:31:05

Archivage à long terme le : mercredi 29 mars 2017-12:44:17

Dates et versions

inria-00099905 , version 1 (26-09-2006)

Identifiants

HAL Id : inria-00099905 , version 1

Citer

Jingying Chen, Marie-Odile Berger, Yves Laprie. An Effective Lip Tracking Algorithm for Acoustic-to-Articulatory Inversion. 5th International Workshop on Image Analysis for Multimedia - WIAMIS'2004, Apr 2004, Lisbon, Portugal, 3 p. ⟨inria-00099905⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA UNIV-LORRAINE INRIA2 LORIA UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

118 Consultations

67 Téléchargements

An Effective Lip Tracking Algorithm for Acoustic-to-Articulatory Inversion

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager