On the quality of an expressive audiovisual corpus: a case study of acted speech

Slim Ouni; Sara Dahmani; Vincent Colotte

Communication Dans Un Congrès Année : 2017

On the quality of an expressive audiovisual corpus: a case study of acted speech

(1, 2) , (2) , (2)

1
2

Slim Ouni

Fonction : Auteur
PersonId : 1158
IdHAL : slim-ouni
ORCID : 0000-0001-5286-7368

Laboratoire Lorrain de Recherche en Informatique et ses Applications

Speech Modeling for Facilitating Oral-Based Communication

Sara Dahmani

Fonction : Auteur
PersonId : 988945

Speech Modeling for Facilitating Oral-Based Communication

Vincent Colotte

Fonction : Auteur
PersonId : 16268
IdHAL : vincent-colotte
IdRef : 070401683

Speech Modeling for Facilitating Oral-Based Communication

Résumé

In the context of developing an expressive audiovisual speech synthesis system, the quality of the audiovisual corpus from which the 3D visual data will be extracted is important. In this paper, we present a perceptive case study on the quality of the expressiveness of a set of emotions acted by a semi-professional actor. We have analyzed the production of this actor pronouncing a set of sentences with acted emotions, during a human emotion-recognition task. We have observed different modalities: audio, real video, 3D-extracted data, as unimodal presentations and bimodal presentations (with audio). The results of this study show the necessity of such perceptive evaluation prior to further exploitation of the data for the synthesis system. The comparison of the modalities shows clearly what the emotions are, that need to be improved during production and how audio and visual components have a strong mutual influence on emotional perception.

Mots clés

Expressive audiovisual speech facial expres- sions acted speech audiovisual perception

Domaines

Autre [q-bio.OT] Intelligence artificielle [cs.AI] Sciences de l'information et de la communication Traitement du signal et de l'image [eess.SP]

Fichier principal

AVSP2017_paper_22.pdf (3.69 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Slim Ouni : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01596614

Soumis le : mercredi 27 septembre 2017-20:08:13

Dernière modification le : lundi 11 septembre 2023-17:41:19

Archivage à long terme le : jeudi 28 décembre 2017-14:20:41

Dates et versions

hal-01596614 , version 1 (27-09-2017)

Identifiants

HAL Id : hal-01596614 , version 1

Citer

Slim Ouni, Sara Dahmani, Vincent Colotte. On the quality of an expressive audiovisual corpus: a case study of acted speech. The 14th International Conference on Auditory-Visual Speech Processing, KTH, Aug 2017, Stockholm, Sweden. ⟨hal-01596614⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD ANR CREATIV-LAB

290 Consultations

158 Téléchargements

On the quality of an expressive audiovisual corpus: a case study of acted speech

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager