Multimodal Violence Detection in Hollywood Movies: State-of-the-Art and Benchmarking

Abstract : This chapter introduces a benchmark evaluation targeting the detection of violent scenes in Hollywood movies. The evaluation was implemented in 2011 and 2012 as an affect task in the framework of the international MediaEval benchmark initiative. We report on these 2 years of evaluation, providing a detailed description of the dataset created, describing the state of the art by studying the results achieved by participants and providing a detailed analysis of two of the best performing multimodal systems. We elaborate on the lessons learned after 2 years to provide insights on future work emphasizing multimodal modeling and fusion.
Document type :
Book sections
Complete list of metadatas

https://hal.inria.fr/hal-00968515
Contributor : Cédric Penet <>
Submitted on : Tuesday, April 1, 2014 - 10:21:26 AM
Last modification on : Friday, November 16, 2018 - 1:23:13 AM

Identifiers

Citation

Claire-Hélène Demarty, Cédric Penet, Bogdan Ionescu, Guillaume Gravier, Mohammad Soleymani. Multimodal Violence Detection in Hollywood Movies: State-of-the-Art and Benchmarking. Ionescu, Bogdan and Benois-Pineau, Jenny and Piatrik, Tomas and Quénot, Georges. Fusion in Computer Vision, Springer International Publishing, pp.185-208, 2014, Advances in Computer Vision and Pattern Recognition, ⟨10.1007/978-3-319-05696-8_8⟩. ⟨hal-00968515⟩

Share

Metrics

Record views

1096