Big Five vs. Prosodic Features as Cues to Detect Abnormality in SSPNET-Personality Corpus

Cédric Fayet; Arnaud Delhay; Damien Lolive; Pierre-François Marteau

Communication Dans Un Congrès Année : 2017

Big Five vs. Prosodic Features as Cues to Detect Abnormality in SSPNET-Personality Corpus

(1) , (1) , (1) , (1)

Cédric Fayet

Fonction : Auteur
PersonId : 1026701

Expressiveness in Human Centered Data/Media

Arnaud Delhay

Fonction : Auteur
PersonId : 5448
IdHAL : arnaud-delhay
ORCID : 0000-0001-6795-7999
IdRef : 122406354

Expressiveness in Human Centered Data/Media

Damien Lolive

Fonction : Auteur
PersonId : 5088
IdHAL : damien-lolive
ORCID : 0000-0002-1110-5444
IdRef : 13017498X

Expressiveness in Human Centered Data/Media

Pierre-François Marteau

Fonction : Auteur
PersonId : 219
IdHAL : pierre-francois-marteau
ORCID : 0000-0002-3963-8795
IdRef : 033981124

Expressiveness in Human Centered Data/Media

Résumé

This paper presents an attempt to evaluate three different sets of features extracted from prosodic descriptors and Big Five traits for building an anomaly detector. The Big Five model enables to capture personality information. Big Five traits are extracted from a manual annotation while Prosodic features are extracted directly from the speech signal. Two different anomaly detection methods are evaluated: Gaussian Mixture Model (GMM) and One-Class SVM (OC-SVM), each one combined with a threshold classification to decide the ”normality” of a sample. The different combinations of models and feature sets are evaluated on the SSPNET-Personality corpus which has already been used in several experiments, including a previous work on separating two types of personality profiles in a supervised way. In this work, we propose the above mentioned unsupervised or semi-supervised methods, and discuss their performance, to detect particular audio-clips produced by a speaker with an abnormal personality. Results show that using automatically extracted prosodic features competes with the Big Five traits. The overall detection performance achieved by the best model is around 0.8 (F1-measure)

Mots clés

Anomaly detection Gaussian Mixture Model One Class-Support Vector Machine Threshold Classification Social Signal Big Five Prosody SSPNET-Personality.

Domaines

Informatique et langage [cs.CL] Apprentissage [cs.LG]

Expression Irisa : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01583510

Soumis le : jeudi 7 septembre 2017-14:17:54

Dernière modification le : mardi 3 octobre 2023-09:49:45

Dates et versions

hal-01583510 , version 1 (07-09-2017)

Identifiants

HAL Id : hal-01583510 , version 1

Citer

Cédric Fayet, Arnaud Delhay, Damien Lolive, Pierre-François Marteau. Big Five vs. Prosodic Features as Cues to Detect Abnormality in SSPNET-Personality Corpus. Interspeech, Aug 2017, Stockholm, Sweden. ⟨hal-01583510⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM UNIV-RENNES1 CNRS INRIA INSA-RENNES ENSSAT IRISA IRISA-D6 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

254 Consultations

0 Téléchargements

Big Five vs. Prosodic Features as Cues to Detect Abnormality in SSPNET-Personality Corpus

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager