Using multimodal speech production data to evaluate articulatory animation for audiovisual speech synthesis - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Using multimodal speech production data to evaluate articulatory animation for audiovisual speech synthesis

Résumé

The importance of modeling speech articulation for high-quality audiovisual (AV) speech synthesis is widely acknowledged. Nevertheless, while state-of-the-art, data-driven approaches to facial animation can make use of sophisticated motion capture techniques, the animation of the intraoral articulators (viz. the tongue, jaw, and velum) typically makes use of simple rules or viseme morphing, in stark contrast to the otherwise high quality of facial modeling. Using appropriate speech production data could significantly improve the quality of articulatory animation for AV synthesis.
Fichier principal
Vignette du fichier
abstract.pdf (415.58 Ko) Télécharger le fichier
slides.pdf (6.34 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Format : Autre

Dates et versions

hal-00734464 , version 1 (22-09-2012)

Identifiants

Citer

Ingmar Steiner, Korin Richmond, Slim Ouni. Using multimodal speech production data to evaluate articulatory animation for audiovisual speech synthesis. 3rd International Symposium on Facial Analysis and Animation - FAA 2012, Sep 2012, Vienna, Austria. ⟨hal-00734464⟩
179 Consultations
239 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More