Beyond "project and sign" for cosine estimation with binary codes

Raghavendran Balu 1 Teddy Furon 1 Hervé Jégou 1
1 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : Many nearest neighbor search algorithms rely on encoding real vectors into binary vectors. The most common strategy projects the vectors onto random directions and takes the sign to produce so-called sketches. This paper discusses the sub-optimality of this choice, and proposes a better encoding strategy based on the quantization and reconstruction points of view. Our second contribution is a novel asymmetric estimator for the cosine similarity. Similar to previous asymmetric schemes, the query is not quantized and the similarity is computed in the compressed domain. Both our contribution leads to improve the quality of nearest neighbor search with binary codes. Its efficiency compares favorably against a recent encoding technique.
Document type :
Conference papers
Complete list of metadatas

Cited literature [21 references]  Display  Hide  Download


https://hal.inria.fr/hal-00942075
Contributor : Hervé Jégou <>
Submitted on : Tuesday, February 4, 2014 - 4:13:04 PM
Last modification on : Friday, November 16, 2018 - 1:24:26 AM
Long-term archiving on : Sunday, April 9, 2017 - 8:47:47 AM

Files

icassp_beyond_project_sign.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00942075, version 1

Citation

Raghavendran Balu, Teddy Furon, Hervé Jégou. Beyond "project and sign" for cosine estimation with binary codes. ICASPP - International Conference on Acoustics, Speech, and Signal Processing, IEEE, May 2014, Florence, Italy. ⟨hal-00942075⟩

Share

Metrics

Record views

755

Files downloads

543