DEEP FEATURES FOR MULTIMODAL EMOTION CLASSIFICATION

Shriman Narayan Tiwari; Ngoc Q. K. Duong; Frédéric Lefebvre; Claire-Helène Demarty; Benoit Huet; Louis Chevallier

Pré-Publication, Document De Travail Année : 2016

DEEP FEATURES FOR MULTIMODAL EMOTION CLASSIFICATION

(1) , (1) , (1) , (1) , (2) , (1)

1
2

Shriman Narayan Tiwari

Fonction : Auteur

Technicolor R & I [Cesson Sévigné]

Ngoc Q. K. Duong

Fonction : Auteur

Technicolor R & I [Cesson Sévigné]

Frédéric Lefebvre

Fonction : Auteur

Technicolor R & I [Cesson Sévigné]

Claire-Helène Demarty

Fonction : Auteur

Technicolor R & I [Cesson Sévigné]

Benoit Huet

Fonction : Auteur

Eurecom [Sophia Antipolis]

Louis Chevallier

Fonction : Auteur

Technicolor R & I [Cesson Sévigné]

Résumé

Understanding human emotion when perceiving audiovisual content is an exciting and important research avenue. Thus, there have been emerging attempts to predict the emotion elicited by video clips or movies recently. While most existing approaches focus either on single modality, i. e., only audio or visual data is exploited, or build on a multimodal scheme with late fusion , we propose a multimodal framework with early fusion scheme and target an emotion classification task. Our proposed mechanism presents the advantages of handling (1) the variation in video length, (2) the imbalance of audio and visual feature sizes, and (3) the middle-level fusion of audio and visual information such that a higher level feature representation can be learned jointly from the two modalities for classification. We evaluate the performance of the proposed approach on the international benchmark, i. e., the MediaEval 2015 Affective Impact of Movies 1 task , and show that it outperforms most state-of-the-art systems on arousal accuracy while using a much smaller feature size .

Mots clés

Index Terms— Multimodal - based emotion classification affective computing deep learning video segmentation fea - ture aggregation

Domaines

Machine Learning [stat.ML] Traitement du signal et de l'image [eess.SP] Multimédia [cs.MM]

Fichier principal

ICIPpaper.pdf (268.29 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Ngoc Duong : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01289191

Soumis le : mercredi 16 mars 2016-11:53:16

Dernière modification le : mercredi 28 juillet 2021-16:26:02

Archivage à long terme le : dimanche 13 novembre 2016-19:44:16

Dates et versions

hal-01289191 , version 1 (16-03-2016)

Identifiants

HAL Id : hal-01289191 , version 1

Citer

Shriman Narayan Tiwari, Ngoc Q. K. Duong, Frédéric Lefebvre, Claire-Helène Demarty, Benoit Huet, et al.. DEEP FEATURES FOR MULTIMODAL EMOTION CLASSIFICATION. 2016. ⟨hal-01289191⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EURECOM

267 Consultations

401 Téléchargements

DEEP FEATURES FOR MULTIMODAL EMOTION CLASSIFICATION

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager