Compact Video Description for Copy Detection with Precise Temporal Alignment

Matthijs Douze 1 Hervé Jégou 2 Cordelia Schmid 1 Patrick Pérez 3
1 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
2 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : This paper introduces a very compact yet discriminative video description, which allows example-based search in a large number of frames corresponding to thousands of hours of video. Our description extracts one descriptor per indexed video frame by aggregating a set of local descriptors. These frame descriptors are encoded using a time-aware hierarchical indexing structure. A modified temporal Hough voting scheme is used to rank the retrieved database videos and estimate segments in them that match the query. If we use a dense temporal description of the videos, matched video segments are localized with excellent precision. Experimental results on the Trecvid 2008 copy detection task and a set of 38000 videos from YouTube show that our method offers an excellent trade-off between search accuracy, efficiency and memory usage.
Type de document :
Communication dans un congrès
Kostas Daniilidis and Petros Maragos and Nikos Paragios. ECCV 2010 - European Conference on Computer Vision, Sep 2010, Heraklion, Greece. Springer-Verlag, 6311, pp.522-535, 2010, Lecture Notes in Computer Science. <http://www.springerlink.com/content/g811634570617831/>. <10.1007/978-3-642-15549-9_38>
Liste complète des métadonnées



https://hal.inria.fr/inria-00548641
Contributeur : Hervé Jégou <>
Soumis le : mardi 22 mars 2011 - 21:22:46
Dernière modification le : vendredi 13 janvier 2017 - 14:20:23
Document(s) archivé(s) le : dimanche 4 décembre 2016 - 01:45:58

Fichiers

paper_hal.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Matthijs Douze, Hervé Jégou, Cordelia Schmid, Patrick Pérez. Compact Video Description for Copy Detection with Precise Temporal Alignment. Kostas Daniilidis and Petros Maragos and Nikos Paragios. ECCV 2010 - European Conference on Computer Vision, Sep 2010, Heraklion, Greece. Springer-Verlag, 6311, pp.522-535, 2010, Lecture Notes in Computer Science. <http://www.springerlink.com/content/g811634570617831/>. <10.1007/978-3-642-15549-9_38>. <inria-00548641v3>

Partager

Métriques

Consultations de
la notice

630

Téléchargements du document

359