MULTIMODAL INFORMATION FUSION AND TEMPORAL INTEGRATION FOR VIOLENCE DETECTION IN MOVIES

Cédric Penet; Claire-Hélène Demarty; Guillaume Gravier; Patrick Gros

Communication Dans Un Congrès Année : 2012

MULTIMODAL INFORMATION FUSION AND TEMPORAL INTEGRATION FOR VIOLENCE DETECTION IN MOVIES

(1, 2) , (1) , (2) , (2)

1
2

Cédric Penet

Fonction : Auteur
PersonId : 914351

Technicolor R & I [Cesson Sévigné]

Multimedia content-based indexing

Claire-Hélène Demarty

Fonction : Auteur

Technicolor R & I [Cesson Sévigné]

Guillaume Gravier

Fonction : Auteur
PersonId : 1046
IdHAL : guig
ORCID : 0000-0002-2266-5682
IdRef : 110355415

Multimedia content-based indexing

Patrick Gros

Fonction : Auteur
PersonId : 894
IdHAL : patrick-gros
IdRef : 075986604

Multimedia content-based indexing

Résumé

This paper presents a violent shots detection system that studies several methods for introducing temporal and multimodal information in the framework. It also investigates different kinds of Bayesian network structure learning algorithms for modelling these problems. The system is trained and tested using the MediaEval 2011 Affect Task corpus, which comprises of 15 Hollywood movies. It is experimentally shown that both multimodality and temporality add interesting information into the system. Moreover, the analysis of the links between the variables of the resulting graphs yields important observations about the quality of the structure learning algorithms. Overall, our best system achieved 50% false alarms and 3% missed detection, which is among the best submissions in the MediaEval campaign.

Domaines

Apprentissage [cs.LG]

Fichier principal

CedricPenet_MultimodalInformationFusionAndTemporalIntegrationForViolenceDetectionInMoviesVFINAL.pdf (280.01 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Cédric Penet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00671016

Soumis le : jeudi 16 février 2012-14:42:38

Dernière modification le : vendredi 24 mars 2023-14:52:55

Archivage à long terme le : jeudi 14 juin 2012-16:35:39

Dates et versions

hal-00671016 , version 1 (16-02-2012)

Identifiants

HAL Id : hal-00671016 , version 1

Citer

Cédric Penet, Claire-Hélène Demarty, Guillaume Gravier, Patrick Gros. MULTIMODAL INFORMATION FUSION AND TEMPORAL INTEGRATION FOR VIOLENCE DETECTION IN MOVIES. ICASSP - 37th International Conference on Acoustics, Speech, and Signal Processing (2012), Mar 2012, Kyoto, Japan. ⟨hal-00671016⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA IRISA-D6 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE UR1-MATH-NUM

261 Consultations

506 Téléchargements

MULTIMODAL INFORMATION FUSION AND TEMPORAL INTEGRATION FOR VIOLENCE DETECTION IN MOVIES

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager