Aggregating local image descriptors into compact codes

Hervé Jégou; Florent Perronnin; Matthijs Douze; Jorge Sánchez; Patrick Pérez; Cordelia Schmid

doi:10.1109/TPAMI.2011.235

Journal Articles IEEE Transactions on Pattern Analysis and Machine Intelligence Year : 2012

Aggregating local image descriptors into compact codes

(1) , (2) , (3, 4) , (2) , (5) , (3)

1
2
3
4
5

Hervé Jégou

Function : Author
PersonId : 833473

Multimedia content-based indexing

Florent Perronnin

Function : Author

Xerox Research Centre Europe [Meylan]

Matthijs Douze

Function : Author
PersonId : 843109

Learning and recognition in vision

Service Expérimentation et Développement

Jorge Sánchez

Function : Author

Xerox Research Centre Europe [Meylan]

Patrick Pérez

Function : Author
PersonId : 1022281

Technicolor R & I [Cesson Sévigné]

Cordelia Schmid

Function : Author
PersonId : 831154

Learning and recognition in vision

Abstract

This paper addresses the problem of large-scale image search. Three constraints have to be taken into account: search accuracy, efficiency, and memory usage. We first present and evaluate different ways of aggregating local image descriptors into a vector and show that the Fisher kernel achieves better performance than the reference bag-of-visual words approach for any given vector dimension. We then jointly optimize dimensionality reduction and indexing in order to obtain a precise vector comparison as well as a compact representation. The evaluation shows that the image representation can be reduced to a few dozen bytes while preserving high accuracy. Searching a 100 million image dataset takes about 250 ms on one processor core.

Keywords

image search image retrieval indexing

Domains

Computer Vision and Pattern Recognition [cs.CV]

Fichier principal

jegou_aggregate.pdf (696.72 Ko)

teaser.jpg (15.64 Ko)

Origin : Files produced by the author(s)

Format : Figure, Image

Hervé Jégou : Connect in order to contact the contributor

https://inria.hal.science/inria-00633013

Submitted on : Monday, October 17, 2011-2:18:08 PM

Last modification on : Thursday, April 4, 2024-9:11:40 PM

Long-term archiving on: Thursday, November 15, 2012-9:50:17 AM

Dates and versions

inria-00633013 , version 1 (17-10-2011)

Identifiers

HAL Id : inria-00633013 , version 1
DOI : 10.1109/TPAMI.2011.235

Cite

Hervé Jégou, Florent Perronnin, Matthijs Douze, Jorge Sánchez, Patrick Pérez, et al.. Aggregating local image descriptors into compact codes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34 (9), pp.1704-1716. ⟨10.1109/TPAMI.2011.235⟩. ⟨inria-00633013⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 UGA CNRS INRIA INSA-RENNES IRISA LJK LJK_GI LJK_GI_LEAR IRISA-D6 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE ANR UR1-MATH-NUM

4195 View

18203 Download

Aggregating local image descriptors into compact codes

Abstract

Keywords

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Altmetric

Share