CLIPS at TRECvid: Shot Boundary Detection and Feature Detection

Georges Quénot 1 Daniel Moraru 2 Laurent Besacier 3
1 MRIM - Modélisation et Recherche d’Information Multimédia [Grenoble]
LIG - Laboratoire d'Informatique de Grenoble, Inria - Institut National de Recherche en Informatique et en Automatique
2 CLIPS-IMAG - Equipe GEOD, Groupe d'étude sur l'oral et le dialogue
LIG - Laboratoire d'Informatique de Grenoble
Abstract : This paper presents the systems used by CLIPS-IMAG to perform the Shot Boundary Detection (SBD) task and the Feature Extraction (FE) task of the TRECvid workshop. Results obtained for the 2003 evaluation are presented. The CLIPS SBD system based on image difference with motion compensation and direct dissolve detection was second among 14 systems. This system gives control of the silence to noise ratio over a wide range of values and for an equal value of noise and silence (or recall and precision), the value is 12 % for all types of transitions. Detection of person X from speaker recognition alone was deceiving due to the small number of shots containing person X in the overall test collection (about 1/700) and the even small number in which person X was actually speaking (about 1/6000). Detection of person X from speech transcription performed much better but was still lower than other systems using also the image track for the detection.
Document type :
Conference papers
Complete list of metadatas

https://hal.inria.fr/hal-00953934
Contributor : Marie-Christine Fauvet <>
Submitted on : Friday, February 28, 2014 - 4:07:07 PM
Last modification on : Monday, July 8, 2019 - 3:09:49 PM

Identifiers

  • HAL Id : hal-00953934, version 1

Citation

Georges Quénot, Daniel Moraru, Laurent Besacier. CLIPS at TRECvid: Shot Boundary Detection and Feature Detection. TRECVID'2003 Workshop, 2003, Gaithersburg, MD, United States. ⟨hal-00953934⟩

Share

Metrics

Record views

137