From Raw Images of the Lips to Articulatory Parameters : A Viseme-based Prediction

Abstract : This paper presents a method for the extraction of articulatory parameters from direct processing of raw images of the lips. The system architecture is made of three independent parts. First, a new greyscale mouth image is centred and downsampled. Second, the image is aligned and projected onto a basis of artificial images. These images are the eigenvectors computed from a PCA applied on a set of 23 reference lip shapes. Then, a multilinear interpolation predicts articulatory parameters from the image projection coefficients onto the eigenvectors. In addition, the projection coefficients and the predicted parameters were evaluated by an HMMbased visual speech recogniser. Recognition scores obtained with our method are compared to reference scores and discussed.
Type de document :
Communication dans un congrès
EUROSPEECH Conference, Sep 1997, Rhodes, Greece. 1997
Liste complète des métadonnées

Littérature citée [5 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00389372
Contributeur : Lionel Reveret <>
Soumis le : jeudi 28 mai 2009 - 16:09:23
Dernière modification le : jeudi 11 janvier 2018 - 06:15:18
Document(s) archivé(s) le : jeudi 10 juin 2010 - 23:59:48

Fichier

euro97.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00389372, version 1

Collections

ICP | UGA

Citation

Lionel Reveret. From Raw Images of the Lips to Articulatory Parameters : A Viseme-based Prediction. EUROSPEECH Conference, Sep 1997, Rhodes, Greece. 1997. 〈inria-00389372〉

Partager

Métriques

Consultations de la notice

129

Téléchargements de fichiers

77