Scene Text Recognition and Retrieval for Large Lexicons

Udit Roy 1 Anand Mishra 1 Karteek Alahari 2 C.V. Jawahar 1
2 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : In this paper we propose a framework for recognition and retrieval tasks in the context of scene text images. In contrast to many of the recent works, we focus on the case where an image-specific list of words, known as the small lexicon setting, is unavailable. We present a conditional random field model defined on potential character locations and the interactions between them. Observing that the interaction potentials computed in the large lexicon setting are less effective than in the case of a small lexicon, we propose an iterative method, which alternates between finding the most likely solution and refining the interaction po-tentials. We evaluate our method on public datasets and show that it improves over baseline and state-of-the-art approaches. For example, we obtain nearly 15% improvement in recognition accuracy and precision for our retrieval task over baseline methods on the IIIT-5K word dataset, with a large lexicon containing 0.5 million words.
Document type :
Conference papers
Complete list of metadatas

Cited literature [19 references]  Display  Hide  Download
Contributor : Karteek Alahari <>
Submitted on : Friday, November 28, 2014 - 3:26:13 PM
Last modification on : Sunday, March 10, 2019 - 1:30:23 PM
Long-term archiving on : Friday, April 14, 2017 - 11:03:30 PM


Files produced by the author(s)




Udit Roy, Anand Mishra, Karteek Alahari, C.V. Jawahar. Scene Text Recognition and Retrieval for Large Lexicons. ACCV - Asian Conference on Computer Vision, Nov 2014, Singapore, Singapore. pp.494-508, ⟨10.1007/978-3-319-16865-4_32⟩. ⟨hal-01088739⟩



Record views


Files downloads