Scene Text Recognition and Retrieval for Large Lexicons

Udit Roy 1 Anand Mishra 1 Karteek Alahari 2 C.V. Jawahar 1
2 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : In this paper we propose a framework for recognition and retrieval tasks in the context of scene text images. In contrast to many of the recent works, we focus on the case where an image-specific list of words, known as the small lexicon setting, is unavailable. We present a conditional random field model defined on potential character locations and the interactions between them. Observing that the interaction potentials computed in the large lexicon setting are less effective than in the case of a small lexicon, we propose an iterative method, which alternates between finding the most likely solution and refining the interaction po-tentials. We evaluate our method on public datasets and show that it improves over baseline and state-of-the-art approaches. For example, we obtain nearly 15% improvement in recognition accuracy and precision for our retrieval task over baseline methods on the IIIT-5K word dataset, with a large lexicon containing 0.5 million words.
Document type :
Conference papers
Complete list of metadatas

Cited literature [19 references]  Display  Hide  Download

https://hal.inria.fr/hal-01088739
Contributor : Karteek Alahari <>
Submitted on : Friday, November 28, 2014 - 3:26:13 PM
Last modification on : Sunday, March 10, 2019 - 1:30:23 PM
Long-term archiving on : Friday, April 14, 2017 - 11:03:30 PM

File

roy14.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Udit Roy, Anand Mishra, Karteek Alahari, C.V. Jawahar. Scene Text Recognition and Retrieval for Large Lexicons. ACCV - Asian Conference on Computer Vision, Nov 2014, Singapore, Singapore. pp.494-508, ⟨10.1007/978-3-319-16865-4_32⟩. ⟨hal-01088739⟩

Share

Metrics

Record views

482

Files downloads

526