On the quality of an expressive audiovisual corpus: a case study of acted speech - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

On the quality of an expressive audiovisual corpus: a case study of acted speech

Résumé

In the context of developing an expressive audiovisual speech synthesis system, the quality of the audiovisual corpus from which the 3D visual data will be extracted is important. In this paper, we present a perceptive case study on the quality of the expressiveness of a set of emotions acted by a semi-professional actor. We have analyzed the production of this actor pronouncing a set of sentences with acted emotions, during a human emotion-recognition task. We have observed different modalities: audio, real video, 3D-extracted data, as unimodal presentations and bimodal presentations (with audio). The results of this study show the necessity of such perceptive evaluation prior to further exploitation of the data for the synthesis system. The comparison of the modalities shows clearly what the emotions are, that need to be improved during production and how audio and visual components have a strong mutual influence on emotional perception.
Fichier principal
Vignette du fichier
AVSP2017_paper_22.pdf (3.69 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01596614 , version 1 (27-09-2017)

Identifiants

  • HAL Id : hal-01596614 , version 1

Citer

Slim Ouni, Sara Dahmani, Vincent Colotte. On the quality of an expressive audiovisual corpus: a case study of acted speech. The 14th International Conference on Auditory-Visual Speech Processing, KTH, Aug 2017, Stockholm, Sweden. ⟨hal-01596614⟩
290 Consultations
158 Téléchargements

Partager

Gmail Facebook X LinkedIn More