Using multimodal speech production data to evaluate articulatory animation for audiovisual speech synthesis

Abstract : The importance of modeling speech articulation for high-quality audiovisual (AV) speech synthesis is widely acknowledged. Nevertheless, while state-of-the-art, data-driven approaches to facial animation can make use of sophisticated motion capture techniques, the animation of the intraoral articulators (viz. the tongue, jaw, and velum) typically makes use of simple rules or viseme morphing, in stark contrast to the otherwise high quality of facial modeling. Using appropriate speech production data could significantly improve the quality of articulatory animation for AV synthesis.
Type de document :
Communication dans un congrès
3rd International Symposium on Facial Analysis and Animation - FAA 2012, Sep 2012, Vienna, Austria. 2012
Liste complète des métadonnées

https://hal.inria.fr/hal-00734464
Contributeur : Ingmar Steiner <>
Soumis le : samedi 22 septembre 2012 - 00:16:12
Dernière modification le : jeudi 11 janvier 2018 - 06:25:24
Document(s) archivé(s) le : vendredi 16 décembre 2016 - 15:16:05

Fichiers

Identifiants

  • HAL Id : hal-00734464, version 1
  • ARXIV : 1209.4982

Collections

Citation

Ingmar Steiner, Korin Richmond, Slim Ouni. Using multimodal speech production data to evaluate articulatory animation for audiovisual speech synthesis. 3rd International Symposium on Facial Analysis and Animation - FAA 2012, Sep 2012, Vienna, Austria. 2012. 〈hal-00734464〉

Partager

Métriques

Consultations de la notice

359

Téléchargements de fichiers

179