Using multimodal speech production data to evaluate articulatory animation for audiovisual speech synthesis

Ingmar Steiner; Korin Richmond; Slim Ouni

Communication Dans Un Congrès Année : 2012

Using multimodal speech production data to evaluate articulatory animation for audiovisual speech synthesis

(1, 2, 3) , (4) , (3)

1
2
3
4

Ingmar Steiner

Fonction : Auteur
PersonId : 917938

School of Computer Science and Informatics [Dublin]

Trinity College Dublin

Analysis, perception and recognition of speech

Korin Richmond

Fonction : Auteur

The Centre for Speech Technology Research [Edinburgh]

Slim Ouni

Fonction : Auteur
PersonId : 1158
IdHAL : slim-ouni
ORCID : 0000-0001-5286-7368

Analysis, perception and recognition of speech

Résumé

The importance of modeling speech articulation for high-quality audiovisual (AV) speech synthesis is widely acknowledged. Nevertheless, while state-of-the-art, data-driven approaches to facial animation can make use of sophisticated motion capture techniques, the animation of the intraoral articulators (viz. the tongue, jaw, and velum) typically makes use of simple rules or viseme morphing, in stark contrast to the otherwise high quality of facial modeling. Using appropriate speech production data could significantly improve the quality of articulatory animation for AV synthesis.

Domaines

Interface homme-machine [cs.HC] Imagerie médicale Synthèse d'image et réalité virtuelle [cs.GR]

Fichier principal

abstract.pdf (415.58 Ko)

slides.pdf (6.34 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Format : Autre

Ingmar Steiner : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00734464

Soumis le : samedi 22 septembre 2012-00:16:12

Dernière modification le : jeudi 7 mars 2024-12:32:05

Archivage à long terme le : vendredi 16 décembre 2016-15:16:05

Dates et versions

hal-00734464 , version 1 (22-09-2012)

Identifiants

HAL Id : hal-00734464 , version 1
ARXIV : 1209.4982

Citer

Ingmar Steiner, Korin Richmond, Slim Ouni. Using multimodal speech production data to evaluate articulatory animation for audiovisual speech synthesis. 3rd International Symposium on Facial Analysis and Animation - FAA 2012, Sep 2012, Vienna, Austria. ⟨hal-00734464⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD

179 Consultations

239 Téléchargements

Using multimodal speech production data to evaluate articulatory animation for audiovisual speech synthesis

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager