An MRF Model for Binarization of Natural Scene Text

Anand Mishra 1 Karteek Alahari 2, 3 C.V. Jawahar 1
3 WILLOW - Models of visual object recognition and scene understanding
DI-ENS - Département d'informatique de l'École normale supérieure, Inria Paris-Rocquencourt, CNRS - Centre National de la Recherche Scientifique : UMR8548
Abstract : Inspired by the success of MRF models for solving object segmentation problems, we formulate the binarization problem in this framework. We represent the pixels in a document image as random variables in an MRF, and introduce a new energy (or cost) function on these variables. Each variable takes a foreground or background label, and the quality of the binarization (or labelling) is determined by the value of the energy function. We minimize the energy function, i.e. find the optimal binarization, using an iterative graph cut scheme. Our model is robust to variations in foreground and background colours as we use a Gaussian Mixture Model in the energy function. In addition, our algorithm is efficient to compute, and adapts to a variety of document images. We show results on word images from the challenging ICDAR 2003 dataset, and compare our performance with previously reported methods. Our approach shows significant improvement in pixel level accuracy as well as OCR accuracy.
Type de document :
Communication dans un congrès
ICDAR - International Conference on Document Analysis and Recognition, Sep 2011, Beijing, China. IEEE, 2011, 〈http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=6065267〉. 〈10.1109/ICDAR.2011.12〉
Liste complète des métadonnées

Littérature citée [16 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00817972
Contributeur : Karteek Alahari <>
Soumis le : jeudi 17 octobre 2013 - 18:59:13
Dernière modification le : lundi 28 mai 2018 - 15:10:02
Document(s) archivé(s) le : samedi 18 janvier 2014 - 02:45:25

Fichier

mishra11.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Anand Mishra, Karteek Alahari, C.V. Jawahar. An MRF Model for Binarization of Natural Scene Text. ICDAR - International Conference on Document Analysis and Recognition, Sep 2011, Beijing, China. IEEE, 2011, 〈http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=6065267〉. 〈10.1109/ICDAR.2011.12〉. 〈hal-00817972〉

Partager

Métriques

Consultations de la notice

1763

Téléchargements de fichiers

400