Discrimination Between Digits and Outliers in Handwritten Documents Applied to the Extraction of Numerical Fields

Abstract : In this article, we propose a numerical field extraction system from unconstrained handwritten documents. The system is based on a segmentation driven by recognition stage followed by a syntactical analysis which detects the sequences that may compose a numerical field. We focus here on the design of a digit classifier embedded in the segmentation/ recognition process able to discriminate digits from outliers such as words, fragment of words, noise, etc. For that, we have developed a light classifier used as prior to a standard digit classifier in order to reject “obvious outliers”. Several classifiers have been compared in terms of ROC curve and processing time.
Type de document :
Communication dans un congrès
Guy Lorette. Tenth International Workshop on Frontiers in Handwriting Recognition, Oct 2006, La Baule (France), Suvisoft, 2006
Liste complète des métadonnées

Littérature citée [14 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00103696
Contributeur : Anne Jaigu <>
Soumis le : jeudi 5 octobre 2006 - 10:18:17
Dernière modification le : jeudi 11 janvier 2018 - 06:19:28
Document(s) archivé(s) le : mardi 6 avril 2010 - 18:19:35

Identifiants

  • HAL Id : inria-00103696, version 1

Citation

Clément Chatelain, Laurent Heutte, Thierry Paquet. Discrimination Between Digits and Outliers in Handwritten Documents Applied to the Extraction of Numerical Fields. Guy Lorette. Tenth International Workshop on Frontiers in Handwriting Recognition, Oct 2006, La Baule (France), Suvisoft, 2006. 〈inria-00103696〉

Partager

Métriques

Consultations de la notice

72

Téléchargements de fichiers

140