Memory vectors for similarity search in high-dimensional spaces

Ahmet Iscen; Teddy Furon; Vincent Gripon; Michael Rabbat; Hervé Jégou

doi:10.1109/TBDATA.2017.2677964

Article Dans Une Revue IEEE Transactions on Big Data Année : 2017

Memory vectors for similarity search in high-dimensional spaces

(1, 2) , (1) , (2, 3) , (4) , (5)

1
2
3
4
5

Ahmet Iscen

Fonction : Auteur correspondant
PersonId : 4814
IdHAL : ahmet-iscen

Connectez-vous pour contacter l'auteur

Creating and exploiting explicit links between multimedia fragments

Lab-STICC_TB_CACS_IAS

Teddy Furon

Fonction : Auteur
PersonId : 3087
IdHAL : teddy-furon
IdRef : 078044758

Creating and exploiting explicit links between multimedia fragments

Vincent Gripon

Fonction : Auteur
PersonId : 21307
IdHAL : vincent-gripon
ORCID : 0000-0002-4353-4542
IdRef : 16122203X

Lab-STICC_TB_CACS_IAS

Département Electronique

Michael Rabbat

Fonction : Auteur

Department of Electrical and Computer Engineering [Montréal]

Hervé Jégou

Fonction : Auteur
PersonId : 1003297

Facebook AI Research [Paris]

Résumé

We study an indexing architecture to store and search in a database of high-dimensional vectors from the perspective of statistical signal processing and decision theory. This architecture is composed of several memory units, each of which summarizes a fraction of the database by a single representative vector. The potential similarity of the query to one of the vectors stored in the memory unit is gauged by a simple correlation with the memory unit's representative vector. This representative optimizes the test of the following hypothesis: the query is independent from any vector in the memory unit vs. the query is a simple perturbation of one of the stored vectors. Compared to exhaustive search, our approach finds the most similar database vectors significantly faster without a noticeable reduction in search quality. Interestingly, the reduction of complexity is provably better in high-dimensional spaces. We empirically demonstrate its practical interest in a large-scale image search scenario with off-the-shelf state-of-the-art descriptors.

Mots clés

image indexing image retrieval High-dimensional indexing

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

iscen_tbd.pdf (787.47 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Ahmet Iscen : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01481220

Soumis le : jeudi 2 mars 2017-12:40:32

Dernière modification le : mardi 23 janvier 2024-11:46:38

Archivage à long terme le : mercredi 31 mai 2017-13:53:37

Dates et versions

hal-01481220 , version 1 (02-03-2017)

Identifiants

HAL Id : hal-01481220 , version 1
DOI : 10.1109/TBDATA.2017.2677964

Citer

Ahmet Iscen, Teddy Furon, Vincent Gripon, Michael Rabbat, Hervé Jégou. Memory vectors for similarity search in high-dimensional spaces. IEEE Transactions on Big Data, 2017, 4 (1), pp.65 - 77. ⟨10.1109/TBDATA.2017.2677964⟩. ⟨hal-01481220⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-BREST INSTITUT-TELECOM UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA ENIB LAB-STICC_ENIB LAB-STICC CENTRALESUPELEC INRIA2 LAB-STICC_TB UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES IMT-ATLANTIQUE ANR UR1-MATH-NUM

612 Consultations

328 Téléchargements

Memory vectors for similarity search in high-dimensional spaces

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager