Image Retrieval using Textual Cues

Anand Mishra 1 Karteek Alahari 2, 3, 4 C.V. Jawahar 1
2 WILLOW - Models of visual object recognition and scene understanding
DI-ENS - Département d'informatique de l'École normale supérieure, Inria Paris-Rocquencourt, CNRS - Centre National de la Recherche Scientifique : UMR8548
4 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : We present an approach for the text-to-image retrieval problem based on textual content present in images. Given the recent developments in understanding text in images, an appealing approach to address this problem is to localize and recognize the text, and then query the database, as in a text retrieval problem. We show that such an approach, despite being based on state-of-the-art methods, is insufficient, and propose a method, where we do not rely on an exact localization and recognition pipeline. We take a query-driven search approach, where we find approximate locations of characters in the text query, and then impose spatial constraints to generate a ranked list of images in the database. The retrieval performance is evaluated on public scene text datasets as well as three large datasets, namely IIIT scene text retrieval, Sports-10K and TV series-1M, we introduce.
Type de document :
Communication dans un congrès
ICCV - IEEE International Conference on Computer Vision, Dec 2013, Sydney, Australia. IEEE, pp.3040-3047, 2013, 〈10.1109/ICCV.2013.378〉
Liste complète des métadonnées

Littérature citée [21 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00875100
Contributeur : Karteek Alahari <>
Soumis le : lundi 21 octobre 2013 - 11:38:08
Dernière modification le : lundi 28 mai 2018 - 15:10:02
Document(s) archivé(s) le : mercredi 22 janvier 2014 - 04:25:44

Fichier

mishra13.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Anand Mishra, Karteek Alahari, C.V. Jawahar. Image Retrieval using Textual Cues. ICCV - IEEE International Conference on Computer Vision, Dec 2013, Sydney, Australia. IEEE, pp.3040-3047, 2013, 〈10.1109/ICCV.2013.378〉. 〈hal-00875100〉

Partager

Métriques

Consultations de la notice

939

Téléchargements de fichiers

372