LIG at TRECVID 2009: Hierarchical Fusion for High Level Feature Extraction - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2009

LIG at TRECVID 2009: Hierarchical Fusion for High Level Feature Extraction

Résumé

We investigated in this work a hierarchical fusion strategy for fusing the outputs of hundreds of descriptors~×~classifier combinations. Over one hundred descriptors gathered in the context of the IRIM consortium were used for HLF detection with up to four different classifiers. The produced classification scores are then fused in order to produce a unique classification score for each video shot and HLF. In order to cope with the redundancy of the information obtained from similar descriptors and from different classifiers using them, we propose a hierarchical fusion approach so that 1) each different source type gets an appropriate global weight, 2) all the descriptors~×~classifier combinations from similar source type are first combined in the optimal way before being merged at the next level. The best LIG run has a Mean Inferred Average Precision of 0.1276, which is significantly above TRECVID 2009 HLF detection task median performance. We found that fusion of the classification scores from different classifier types improves the performance and that even with a quite low individual performance, audio descriptors can help.
Fichier non déposé

Dates et versions

hal-00953859 , version 1 (28-02-2014)

Identifiants

  • HAL Id : hal-00953859 , version 1

Citer

Bahjat Safadi, Georges Quénot. LIG at TRECVID 2009: Hierarchical Fusion for High Level Feature Extraction. TREC Video Retrieval Evaluation workshop, 2009, Gaithersburg, MD, United States. ⟨hal-00953859⟩
67 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More