Word-wise Hand-written Script Separation for Indian Postal automation

Abstract : In a multi-lingual multi-script country like India, a postal document may contain words of two or more scripts. For recognition of this document it is necessary to separate different scripts from the document. In this paper, an automatic scheme for word-wise identification of hand-written Roman and Oriya scripts is proposed for Indian postal automation. In the proposed scheme, at first, document skew is corrected. Next, using a piecewise projection method the document is segmented into lines and then lines into words. Finally, using different features like, water reservoir concept based features, fractal dimension based features, topological features, scripts characteristics based features etc., a Neural Network (NN) classifier is used for word-wise script identification. For experiment we consider 2500 words and overall accuracy of 97.69% is obtained from the proposed identification scheme.
Type de document :
Communication dans un congrès
Guy Lorette. Tenth International Workshop on Frontiers in Handwriting Recognition, Oct 2006, La Baule (France), Suvisoft, 2006
Liste complète des métadonnées

Littérature citée [16 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00104358
Contributeur : Anne Jaigu <>
Soumis le : vendredi 6 octobre 2006 - 13:07:21
Dernière modification le : vendredi 6 octobre 2006 - 13:23:40
Document(s) archivé(s) le : mardi 6 avril 2010 - 18:49:10

Identifiants

  • HAL Id : inria-00104358, version 1

Collections

Citation

K. Roy, U. Pal. Word-wise Hand-written Script Separation for Indian Postal automation. Guy Lorette. Tenth International Workshop on Frontiers in Handwriting Recognition, Oct 2006, La Baule (France), Suvisoft, 2006. 〈inria-00104358〉

Partager

Métriques

Consultations de la notice

113

Téléchargements de fichiers

217