Word-wise Hand-written Script Separation for Indian Postal automation - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2006

Word-wise Hand-written Script Separation for Indian Postal automation

Résumé

In a multi-lingual multi-script country like India, a postal document may contain words of two or more scripts. For recognition of this document it is necessary to separate different scripts from the document. In this paper, an automatic scheme for word-wise identification of hand-written Roman and Oriya scripts is proposed for Indian postal automation. In the proposed scheme, at first, document skew is corrected. Next, using a piecewise projection method the document is segmented into lines and then lines into words. Finally, using different features like, water reservoir concept based features, fractal dimension based features, topological features, scripts characteristics based features etc., a Neural Network (NN) classifier is used for word-wise script identification. For experiment we consider 2500 words and overall accuracy of 97.69% is obtained from the proposed identification scheme.
Fichier principal
Vignette du fichier
cr1065210301587.pdf (321.95 Ko) Télécharger le fichier
Loading...

Dates et versions

inria-00104358 , version 1 (06-10-2006)

Identifiants

  • HAL Id : inria-00104358 , version 1

Citer

K. Roy, U. Pal. Word-wise Hand-written Script Separation for Indian Postal automation. Tenth International Workshop on Frontiers in Handwriting Recognition, Université de Rennes 1, Oct 2006, La Baule (France). ⟨inria-00104358⟩

Collections

IWFHR10
111 Consultations
179 Téléchargements

Partager

Gmail Facebook X LinkedIn More