Video Story Segmentation with Multi-Modal Features: Experiments on TRECvid 2003

Laurent Besacier; Georges Quénot; Stéphane Ayache; Daniel Moraru

Communication Dans Un Congrès Année : 2004

Video Story Segmentation with Multi-Modal Features: Experiments on TRECvid 2003

(1) , (2) , (3) , (4)

1
2
3
4

Laurent Besacier

Fonction : Auteur
PersonId : 1521
IdHAL : laurent-besacier
ORCID : 0000-0001-7411-9125
IdRef : 079377017

Laboratoire d'Informatique de Grenoble

Georges Quénot

Fonction : Auteur
PersonId : 3114
IdHAL : georges-quenot
ORCID : 0000-0003-2117-247X
IdRef : 034104518

Modélisation et Recherche d’Information Multimédia [Grenoble]

Stéphane Ayache

Fonction : Auteur
PersonId : 16733
IdHAL : stephane-ayache
ORCID : 0000-0003-2982-7127
IdRef : 129313254

Laboratoire d'informatique Fondamentale de Marseille - UMR 6166

Daniel Moraru

Fonction : Auteur

Equipe GEOD, Groupe d'étude sur l'oral et le dialogue

Résumé

This paper describes the first steps of CLIPS/IMAG on the TREC video story segmentation task. We mostly describe the multi-modal features used and their respective performance for the story segmentation task. These features are based on the audio, video and text modalities. The preliminary system, which has the advantage to be relatively free with respect to the use of training data, is also presented in this paper. First experiments on the TRECVID 2003 evaluation set lead to a recall rate of 0.613 and a precision rate of 0.467.

Mots clés

Story segmentation TRECVID Multi-modal

Domaines

Recherche d'information [cs.IR]

Marie-Christine Fauvet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00953929

Soumis le : vendredi 28 février 2014-16:06:52

Dernière modification le : jeudi 4 avril 2024-18:27:38

Dates et versions

hal-00953929 , version 1 (28-02-2014)

Identifiants

HAL Id : hal-00953929 , version 1

Citer

Laurent Besacier, Georges Quénot, Stéphane Ayache, Daniel Moraru. Video Story Segmentation with Multi-Modal Features: Experiments on TRECvid 2003. 6th ACM SIGMM International Workshop on Multimedia Information Retrieval (MIR'04), 2004, New York, NY, United States. ⟨hal-00953929⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA LIF CNRS UNIV-AMU LIG LIG_TDCGE LIG_TDCGE_MRIM LIS-LAB POLYTECH-GRENOBLE LIG_SIDCH

217 Consultations

0 Téléchargements

Video Story Segmentation with Multi-Modal Features: Experiments on TRECvid 2003

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager