Adapting visual data to a linear articulatory model

Yves Laprie; Blaise Potard

Communication Dans Un Congrès Année : 2006

Adapting visual data to a linear articulatory model

(1) , (1)

Yves Laprie

Fonction : Auteur
PersonId : 6696
IdHAL : yves-laprie
ORCID : 0000-0002-2379-6481
IdRef : 060274387

Analysis, perception and recognition of speech

Blaise Potard

Fonction : Auteur

Analysis, perception and recognition of speech

Résumé

The goal of this work is to investigate audiovisual-to-articulatory inversion. It is well established that acoustic-to-articulatory inversion is an underdetermined problem. On the other hand, there is strong evidence that human speakers/listeners exploit the multimodality of speech, and more particularly the articulatory cues: the view of visible articulators, i.e. jaw and lips, improves speech intelligibility. It is thus interesting to add constraints provided by the direct visual observation of the speaker's face. Visible data was obtained by stereo-vision and enable the 3D recovery of jaw and lip movements. These data were processed to fit the nature of parameters of Maeda's articulatory model. Inversion experiments were conducted.

Mots clés

audiovisual inversion

Domaines

Informatique et langage [cs.CL]

Fichier principal

audiovisualinv.pdf (198.35 Ko)

Blaise Potard : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00112223

Soumis le : mardi 7 novembre 2006-19:16:56

Dernière modification le : vendredi 24 mars 2023-14:52:48

Archivage à long terme le : mardi 6 avril 2010-21:50:35

Dates et versions

inria-00112223 , version 1 (07-11-2006)

Identifiants

HAL Id : inria-00112223 , version 1

Citer

Yves Laprie, Blaise Potard. Adapting visual data to a linear articulatory model. 7th International Seminar on Speech Production - ISSP 2006, Dec 2006, Sao Paulo/Brazil. ⟨inria-00112223⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA

160 Consultations

100 Téléchargements

Adapting visual data to a linear articulatory model

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager