Scene Text Recognition and Retrieval for Large Lexicons

Udit Roy; Anand Mishra; Karteek Alahari; C.V. Jawahar

doi:10.1007/978-3-319-16865-4_32

Communication Dans Un Congrès Année : 2015

Scene Text Recognition and Retrieval for Large Lexicons

(1) , (1) , (2) , (1)

1
2

Udit Roy

Fonction : Auteur

Center for Visual Information Technology [Hyderabad]

Anand Mishra

Fonction : Auteur

Center for Visual Information Technology [Hyderabad]

Karteek Alahari

Fonction : Auteur
PersonId : 19670
IdHAL : karteek
ORCID : 0000-0002-1838-5936
IdRef : 196283892

Learning and recognition in vision

C.V. Jawahar

Fonction : Auteur
PersonId : 835846

Center for Visual Information Technology [Hyderabad]

Résumé

In this paper we propose a framework for recognition and retrieval tasks in the context of scene text images. In contrast to many of the recent works, we focus on the case where an image-specific list of words, known as the small lexicon setting, is unavailable. We present a conditional random field model defined on potential character locations and the interactions between them. Observing that the interaction potentials computed in the large lexicon setting are less effective than in the case of a small lexicon, we propose an iterative method, which alternates between finding the most likely solution and refining the interaction po-tentials. We evaluate our method on public datasets and show that it improves over baseline and state-of-the-art approaches. For example, we obtain nearly 15% improvement in recognition accuracy and precision for our retrieval task over baseline methods on the IIIT-5K word dataset, with a large lexicon containing 0.5 million words.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

roy14.pdf (637.94 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Karteek Alahari : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01088739

Soumis le : vendredi 28 novembre 2014-15:26:13

Dernière modification le : vendredi 5 avril 2024-03:08:20

Archivage à long terme le : vendredi 14 avril 2017-23:03:30

Dates et versions

hal-01088739 , version 1 (28-11-2014)

Identifiants

HAL Id : hal-01088739 , version 1
DOI : 10.1007/978-3-319-16865-4_32

Citer

Udit Roy, Anand Mishra, Karteek Alahari, C.V. Jawahar. Scene Text Recognition and Retrieval for Large Lexicons. ACCV - Asian Conference on Computer Vision, Nov 2014, Singapore, Singapore. pp.494-508, ⟨10.1007/978-3-319-16865-4_32⟩. ⟨hal-01088739⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA LIAMA LJK LJK_GI LJK_GI_LEAR INRIA2

284 Consultations

478 Téléchargements

Scene Text Recognition and Retrieval for Large Lexicons

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager