Image Retrieval using Textual Cues

Anand Mishra; Karteek Alahari; C.V. Jawahar

doi:10.1109/ICCV.2013.378

Communication Dans Un Congrès Année : 2013

Image Retrieval using Textual Cues

(1) , (2, 3, 4) , (1)

1
2
3
4

Anand Mishra

Fonction : Auteur

Center for Visual Information Technology [Hyderabad]

Karteek Alahari

Fonction : Auteur
PersonId : 19670
IdHAL : karteek
ORCID : 0000-0002-1838-5936
IdRef : 196283892

Models of visual object recognition and scene understanding

Laboratoire d'informatique de l'école normale supérieure

Learning and recognition in vision

C.V. Jawahar

Fonction : Auteur

Center for Visual Information Technology [Hyderabad]

Résumé

We present an approach for the text-to-image retrieval problem based on textual content present in images. Given the recent developments in understanding text in images, an appealing approach to address this problem is to localize and recognize the text, and then query the database, as in a text retrieval problem. We show that such an approach, despite being based on state-of-the-art methods, is insufficient, and propose a method, where we do not rely on an exact localization and recognition pipeline. We take a query-driven search approach, where we find approximate locations of characters in the text query, and then impose spatial constraints to generate a ranked list of images in the database. The retrieval performance is evaluated on public scene text datasets as well as three large datasets, namely IIIT scene text retrieval, Sports-10K and TV series-1M, we introduce.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

mishra13.pdf (3.26 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Karteek Alahari : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00875100

Soumis le : lundi 21 octobre 2013-11:38:08

Dernière modification le : jeudi 4 avril 2024-21:13:11

Archivage à long terme le : mercredi 22 janvier 2014-04:25:44

Dates et versions

hal-00875100 , version 1 (21-10-2013)

Identifiants

HAL Id : hal-00875100 , version 1
DOI : 10.1109/ICCV.2013.378

Citer

Anand Mishra, Karteek Alahari, C.V. Jawahar. Image Retrieval using Textual Cues. ICCV - IEEE International Conference on Computer Vision, Dec 2013, Sydney, Australia. pp.3040-3047, ⟨10.1109/ICCV.2013.378⟩. ⟨hal-00875100⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS UGA CNRS INRIA LJK LJK_GI LJK_GI_LEAR QUAERO INRIA2 PSL

646 Consultations

520 Téléchargements

Image Retrieval using Textual Cues

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager