hal-00661581, version 1
Babaz: a large scale audio search system for video copy detection
Hervé Jégou
1Jonathan Delhumeau
1Jiangbo Yuan a, 1, 2Guillaume Gravier
b, 1Patrick Gros
c, 1
ICASSP - 37th International Conference on Acoustics, Speech, and Signal Processing (2012)
Résumé : This paper presents Babaz, an audio search system to search modified segments in large databases of music or video tracks. It is based on an efficient audio feature matching system which exploits the reciprocal nearest neighbors to produce a per-match similarity score. Temporal consistency is taken into account based on the audio matches, and boundary estimation allows the precise localization of the matching segments. The method is mainly intended for video retrieval based on their audio track, as typically evaluated in the copy detection task of Trecvid evaluation campaigns. The evaluation conducted on music retrieval shows that our system is comparable to a reference audio fingerprinting system for music retrieval, and significantly outperforms it on audio-based video retrieval, as shown by our experiments conducted on the dataset used in the copy detection task of Trecvid'2010 campaign.
- a – Florida State University
- b – CNRS
- c – INRIA
- 1 : TEXMEX (INRIA - IRISA)
- CNRS : UMR6074 – INRIA – INSA Rennes – Université de Rennes 1
- 2 : Florida State University (FSU)
- Florida State University
- Domaine : Informatique/Vision par ordinateur et reconnaissance de formes
- hal-00661581, version 1
- http://hal.inria.fr/hal-00661581
- oai:hal.inria.fr:hal-00661581
- Contributeur : Hervé Jégou
- Soumis le : Vendredi 20 Janvier 2012, 10:15:58
- Dernière modification le : Vendredi 20 Janvier 2012, 11:21:31






Documents associés
Exporter