Machine Print Filter for Handwriting Analysis

Abstract : Existing document decomposition models fail in well separating the zone areas of printed text and handwritten text when they are close or even touching each other. This paper presents a simple and robust algorithm to filter out the printed content in a mixed document. Following the traditional bottom-up approach, the printed text candidates are extracted and detected in the connected component level. Then the relative spatial relation and windowbased filter help providing useful information to decide the components to be removed finally. Results with manually cropped signature blocks that contain extraneous printed text show that over 85% of printed text components are removed while preserving handwritten content.
Type de document :
Communication dans un congrès
Guy Lorette. Tenth International Workshop on Frontiers in Handwriting Recognition, Oct 2006, La Baule (France), Suvisoft, 2006


https://hal.inria.fr/inria-00112683
Contributeur : Anne Jaigu <>
Soumis le : jeudi 9 novembre 2006 - 15:23:51
Dernière modification le : jeudi 9 novembre 2006 - 16:51:24
Document(s) archivé(s) le : mardi 6 avril 2010 - 22:02:43

Identifiants

  • HAL Id : inria-00112683, version 1

Collections

Citation

Siyuan Chen, Sargur Srihari. Machine Print Filter for Handwriting Analysis. Guy Lorette. Tenth International Workshop on Frontiers in Handwriting Recognition, Oct 2006, La Baule (France), Suvisoft, 2006. <inria-00112683>

Partager

Métriques

Consultations de
la notice

82

Téléchargements du document

98