Skip to Main content Skip to Navigation
Conference papers

From Raw Images of the Lips to Articulatory Parameters : A Viseme-based Prediction

Abstract : This paper presents a method for the extraction of articulatory parameters from direct processing of raw images of the lips. The system architecture is made of three independent parts. First, a new greyscale mouth image is centred and downsampled. Second, the image is aligned and projected onto a basis of artificial images. These images are the eigenvectors computed from a PCA applied on a set of 23 reference lip shapes. Then, a multilinear interpolation predicts articulatory parameters from the image projection coefficients onto the eigenvectors. In addition, the projection coefficients and the predicted parameters were evaluated by an HMMbased visual speech recogniser. Recognition scores obtained with our method are compared to reference scores and discussed.
Complete list of metadatas

Cited literature [5 references]  Display  Hide  Download

https://hal.inria.fr/inria-00389372
Contributor : Lionel Reveret <>
Submitted on : Thursday, May 28, 2009 - 4:09:23 PM
Last modification on : Thursday, January 11, 2018 - 6:15:18 AM
Document(s) archivé(s) le : Thursday, June 10, 2010 - 11:59:48 PM

File

euro97.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00389372, version 1

Collections

CNRS | ICP | UGA

Citation

Lionel Reveret. From Raw Images of the Lips to Articulatory Parameters : A Viseme-based Prediction. EUROSPEECH Conference, Sep 1997, Rhodes, Greece. ⟨inria-00389372⟩

Share

Metrics

Record views

152

Files downloads

96