From Raw Images of the Lips to Articulatory Parameters : A Viseme-based Prediction - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 1997

From Raw Images of the Lips to Articulatory Parameters : A Viseme-based Prediction

Lionel Reveret

Résumé

This paper presents a method for the extraction of articulatory parameters from direct processing of raw images of the lips. The system architecture is made of three independent parts. First, a new greyscale mouth image is centred and downsampled. Second, the image is aligned and projected onto a basis of artificial images. These images are the eigenvectors computed from a PCA applied on a set of 23 reference lip shapes. Then, a multilinear interpolation predicts articulatory parameters from the image projection coefficients onto the eigenvectors. In addition, the projection coefficients and the predicted parameters were evaluated by an HMMbased visual speech recogniser. Recognition scores obtained with our method are compared to reference scores and discussed.
Fichier principal
Vignette du fichier
euro97.pdf (164.03 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00389372 , version 1 (28-05-2009)

Identifiants

  • HAL Id : inria-00389372 , version 1

Citer

Lionel Reveret. From Raw Images of the Lips to Articulatory Parameters : A Viseme-based Prediction. EUROSPEECH Conference, Sep 1997, Rhodes, Greece. ⟨inria-00389372⟩
116 Consultations
87 Téléchargements

Partager

Gmail Facebook X LinkedIn More