Audiovisual Generation of Social Attitudes from Neutral Stimuli

The focus of this study is the generation of expressive audiovisual speech from neutral utterances for 3D virtual actors. Taking into account the segmental and suprasegmental aspects of audiovisual speech, we propose and compare several computational frameworks for the generation of expressive speech and face animation. We notably evaluate a standard frame-based conversion approach with two other methods that postulate the existence of global prosodic audiovisual patterns that are characteristic of social attitudes. The proposed approaches are tested on a database of " Exercises in Style " [1] performed by two semi-professional actors and results are evaluated using crowdsourced perceptual tests. The first test performs a qualitative validation of the animation platform while the second is a comparative study between several expressive speech generation methods. We evaluate how the expressiveness of our audiovisual performances is perceived in comparison to resynthesized original utterances and the outputs of a purely frame-based conversion system.

Mots clés

Virtual actors expressive speech animation audiovisual prosody GMM superposition of functional contours

Domaines

Synthèse d'image et réalité virtuelle [cs.GR] Traitement du signal et de l'image [eess.SP] Multimédia [cs.MM] Réseau de neurones [cs.NE]

Fichier principal

audiovisual-generation-avsp-2015.pdf (421.27 Ko)

audiovisual_generation_of_social_attitudes_from_neural_stimuli.jpeg (33.2 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Format : Figure, Image
Origine : Fichiers produits par l'(les) auteur(s)

Rémi Ronfard : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01178056

Soumis le : mardi 22 décembre 2015-21:26:02

Dernière modification le : jeudi 4 avril 2024-21:28:56

Archivage à long terme le : dimanche 30 avril 2017-00:09:16

Dates et versions

hal-01178056 , version 1 (22-12-2015)

Identifiants

HAL Id : hal-01178056 , version 1

Citer

Adela Barbulescu, Gérard Bailly, Rémi Ronfard, Maël Pouget. Audiovisual Generation of Social Attitudes from Neutral Stimuli. FAAVSP 2015 - 1st Joint Conference on Facial Analysis, Animation and Auditory-Visual Speech Processing, Sep 2015, Vienne, Austria. pp.34-39. ⟨hal-01178056⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UGA CNRS INRIA IRISA GIPSA GIPSA-DPC LJK LJK_GI LJK_GI_IMAGINE PERSYVAL-LAB GIPSA-CRISSP INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES ANR UR1-MATH-NUM

554 Consultations

252 Téléchargements