Big Five vs. Prosodic Features as Cues to Detect Abnormality in SSPNET-Personality Corpus

Cédric Fayet 1 Arnaud Delhay 1 Damien Lolive 1 Pierre-François Marteau 1
1 EXPRESSION - Expressiveness in Human Centered Data/Media
UBS - Université de Bretagne Sud, IRISA-D6 - MEDIA ET INTERACTIONS
Abstract : This paper presents an attempt to evaluate three different sets of features extracted from prosodic descriptors and Big Five traits for building an anomaly detector. The Big Five model enables to capture personality information. Big Five traits are extracted from a manual annotation while Prosodic features are extracted directly from the speech signal. Two different anomaly detection methods are evaluated: Gaussian Mixture Model (GMM) and One-Class SVM (OC-SVM), each one combined with a threshold classification to decide the ”normality” of a sample. The different combinations of models and feature sets are evaluated on the SSPNET-Personality corpus which has already been used in several experiments, including a previous work on separating two types of personality profiles in a supervised way. In this work, we propose the above mentioned unsupervised or semi-supervised methods, and discuss their performance, to detect particular audio-clips produced by a speaker with an abnormal personality. Results show that using automatically extracted prosodic features competes with the Big Five traits. The overall detection performance achieved by the best model is around 0.8 (F1-measure)
Complete list of metadatas

https://hal.inria.fr/hal-01583510
Contributor : Expression Irisa <>
Submitted on : Thursday, September 7, 2017 - 2:17:54 PM
Last modification on : Friday, January 11, 2019 - 4:23:38 PM

Identifiers

  • HAL Id : hal-01583510, version 1

Citation

Cédric Fayet, Arnaud Delhay, Damien Lolive, Pierre-François Marteau. Big Five vs. Prosodic Features as Cues to Detect Abnormality in SSPNET-Personality Corpus. Interspeech, Aug 2017, Stockholm, Sweden. ⟨hal-01583510⟩

Share

Metrics

Record views

386