On the quality of an expressive audiovisual corpus: a case study of acted speech - Archive ouverte HAL Access content directly
Conference Papers Year : 2017

On the quality of an expressive audiovisual corpus: a case study of acted speech

(1, 2) , (2) , (2)
1
2

Abstract

In the context of developing an expressive audiovisual speech synthesis system, the quality of the audiovisual corpus from which the 3D visual data will be extracted is important. In this paper, we present a perceptive case study on the quality of the expressiveness of a set of emotions acted by a semi-professional actor. We have analyzed the production of this actor pronouncing a set of sentences with acted emotions, during a human emotion-recognition task. We have observed different modalities: audio, real video, 3D-extracted data, as unimodal presentations and bimodal presentations (with audio). The results of this study show the necessity of such perceptive evaluation prior to further exploitation of the data for the synthesis system. The comparison of the modalities shows clearly what the emotions are, that need to be improved during production and how audio and visual components have a strong mutual influence on emotional perception.
Fichier principal
Vignette du fichier
AVSP2017_paper_22.pdf (3.69 Mo) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01596614 , version 1 (27-09-2017)

Identifiers

  • HAL Id : hal-01596614 , version 1

Cite

Slim Ouni, Sara Dahmani, Vincent Colotte. On the quality of an expressive audiovisual corpus: a case study of acted speech. The 14th International Conference on Auditory-Visual Speech Processing, KTH, Aug 2017, Stockholm, Sweden. ⟨hal-01596614⟩
278 View
135 Download

Share

Gmail Facebook Twitter LinkedIn More