Skip to Main content Skip to Navigation
Conference papers

From Raw Images of the Lips to Articulatory Parameters : A Viseme-based Prediction

Abstract : This paper presents a method for the extraction of articulatory parameters from direct processing of raw images of the lips. The system architecture is made of three independent parts. First, a new greyscale mouth image is centred and downsampled. Second, the image is aligned and projected onto a basis of artificial images. These images are the eigenvectors computed from a PCA applied on a set of 23 reference lip shapes. Then, a multilinear interpolation predicts articulatory parameters from the image projection coefficients onto the eigenvectors. In addition, the projection coefficients and the predicted parameters were evaluated by an HMMbased visual speech recogniser. Recognition scores obtained with our method are compared to reference scores and discussed.
Document type :
Conference papers
Complete list of metadata

Cited literature [5 references]  Display  Hide  Download

https://hal.inria.fr/inria-00389372
Contributor : Lionel Reveret <>
Submitted on : Thursday, May 28, 2009 - 4:09:23 PM
Last modification on : Tuesday, July 27, 2021 - 3:54:02 PM
Long-term archiving on: : Thursday, June 10, 2010 - 11:59:48 PM

File

euro97.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00389372, version 1

Collections

CNRS | ICP | UGA

Citation

Lionel Reveret. From Raw Images of the Lips to Articulatory Parameters : A Viseme-based Prediction. EUROSPEECH Conference, Sep 1997, Rhodes, Greece. ⟨inria-00389372⟩

Share

Metrics

Record views

170

Files downloads

114