Skip to Main content Skip to Navigation
Conference papers

Word-wise Hand-written Script Separation for Indian Postal automation

Abstract : In a multi-lingual multi-script country like India, a postal document may contain words of two or more scripts. For recognition of this document it is necessary to separate different scripts from the document. In this paper, an automatic scheme for word-wise identification of hand-written Roman and Oriya scripts is proposed for Indian postal automation. In the proposed scheme, at first, document skew is corrected. Next, using a piecewise projection method the document is segmented into lines and then lines into words. Finally, using different features like, water reservoir concept based features, fractal dimension based features, topological features, scripts characteristics based features etc., a Neural Network (NN) classifier is used for word-wise script identification. For experiment we consider 2500 words and overall accuracy of 97.69% is obtained from the proposed identification scheme.
Complete list of metadata

Cited literature [16 references]  Display  Hide  Download

https://hal.inria.fr/inria-00104358
Contributor : Anne Jaigu <>
Submitted on : Friday, October 6, 2006 - 1:07:21 PM
Last modification on : Saturday, July 28, 2018 - 2:54:01 PM
Long-term archiving on: : Tuesday, April 6, 2010 - 6:49:10 PM

Identifiers

  • HAL Id : inria-00104358, version 1

Collections

Citation

K. Roy, U. Pal. Word-wise Hand-written Script Separation for Indian Postal automation. Tenth International Workshop on Frontiers in Handwriting Recognition, Université de Rennes 1, Oct 2006, La Baule (France). ⟨inria-00104358⟩

Share

Metrics

Record views

166

Files downloads

297