Large Scale Metric Learning for Distance-Based Image Classification

Thomas Mensink 1, 2, * Jakob Verbeek 1 Florent Perronnin 2 Gabriela Csurka 2
* Corresponding author
1 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : This paper studies large-scale image classification, in a setting where new classes and training images could continuously be added at (near) zero cost. We cast this problem into one of learning a low-rank metric, which is shared across all classes and explore k-nearest neighbor (k-NN) and nearest class mean (NCM) classifiers. We also introduce an extension of the NCM classifier to allow for richer image representations. Experiments on the ImageNet 2010 challenge dataset ---which contains more than 1M training images of 1K classes--- shows, surprisingly, that the NCM classifier compares favorably to the more flexible k-NN classifier. Moreover, the NCM performance suggests that 256 dimensional features is comparable to that of linear SVMs, which were used to obtain the current state-of-the-art performance. Experimentally we study the generalization performance to classes that were not used to learn the metrics, and show how a zero-shot model based on the ImageNet hierarchy can be combined effectively with small training datasets. Using a metric learned on 1K classes, we show results for the ImageNet-10K dataset, and obtain performance that is competitive with the current state-of-the-art, while requiring significant less training time.
Liste complète des métadonnées

Cited literature [40 references]  Display  Hide  Download


https://hal.inria.fr/hal-00735908
Contributor : Thoth Team <>
Submitted on : Thursday, September 27, 2012 - 5:58:53 PM
Last modification on : Tuesday, February 12, 2019 - 10:30:05 AM
Document(s) archivé(s) le : Friday, December 16, 2016 - 6:12:10 PM

Files

RR-8077.pdf
Explicit agreement for this submission

Identifiers

  • HAL Id : hal-00735908, version 1

Collections

Citation

Thomas Mensink, Jakob Verbeek, Florent Perronnin, Gabriela Csurka. Large Scale Metric Learning for Distance-Based Image Classification. [Research Report] RR-8077, INRIA. 2012, pp.30. ⟨hal-00735908⟩

Share

Metrics

Record views

1500

Files downloads

1560