Scene Text Recognition using Higher Order Language Priors

Anand Mishra 1 Karteek Alahari 2, 3 C.V. Jawahar 1
3 WILLOW - Models of visual object recognition and scene understanding
CNRS - Centre National de la Recherche Scientifique : UMR8548, Inria Paris-Rocquencourt, DI-ENS - Département d'informatique de l'École normale supérieure
Abstract : The problem of recognizing text in images taken in the wild has gained significant attention from the computer vision community in recent years. Contrary to recognition of printed documents, recognizing scene text is a challenging problem. We focus on the problem of recognizing text extracted from natural scene images and the web. Significant attempts have been made to address this problem in the recent past. However, many of these works benefit from the availability of strong context, which naturally limits their applicability. In this work we present a framework that uses a higher order prior computed from an English dictionary to recognize a word, which may or may not be a part of the dictionary. We show experimental results on publicly available datasets. Furthermore, we introduce a large challenging word dataset with five thousand words to evaluate various steps of our method exhaustively. The main contributions of this work are: (1) We present a framework, which incorporates higher order statistical language models to recognize words in an unconstrained manner (i.e. we overcome the need for restricted word lists, and instead use an English dictionary to compute the priors). (2) We achieve significant improvement (more than 20%) in word recognition accuracies without using a restricted word list. (3) We introduce a large word recognition dataset (atleast 5 times larger than other public datasets) with character level annotation and benchmark it.
Type de document :
Communication dans un congrès
BMVC - British Machine Vision Conference, Sep 2012, Surrey, United Kingdom. BMVA, 2012, 〈http://www.bmva.org/bmvc/2012/BMVC/paper127/index.html〉. 〈10.5244/C.26.127〉
Liste complète des métadonnées

Littérature citée [20 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00818183
Contributeur : Karteek Alahari <>
Soumis le : jeudi 17 octobre 2013 - 18:55:37
Dernière modification le : lundi 28 mai 2018 - 15:10:02
Document(s) archivé(s) le : samedi 18 janvier 2014 - 02:55:24

Fichier

mishra12a.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Anand Mishra, Karteek Alahari, C.V. Jawahar. Scene Text Recognition using Higher Order Language Priors. BMVC - British Machine Vision Conference, Sep 2012, Surrey, United Kingdom. BMVA, 2012, 〈http://www.bmva.org/bmvc/2012/BMVC/paper127/index.html〉. 〈10.5244/C.26.127〉. 〈hal-00818183〉

Partager

Métriques

Consultations de la notice

758

Téléchargements de fichiers

458