Discrimination Between Digits and Outliers in Handwritten Documents Applied to the Extraction of Numerical Fields - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2006

Discrimination Between Digits and Outliers in Handwritten Documents Applied to the Extraction of Numerical Fields

Laurent Heutte
Thierry Paquet

Résumé

In this article, we propose a numerical field extraction system from unconstrained handwritten documents. The system is based on a segmentation driven by recognition stage followed by a syntactical analysis which detects the sequences that may compose a numerical field. We focus here on the design of a digit classifier embedded in the segmentation/ recognition process able to discriminate digits from outliers such as words, fragment of words, noise, etc. For that, we have developed a light classifier used as prior to a standard digit classifier in order to reject “obvious outliers”. Several classifiers have been compared in terms of ROC curve and processing time.
Fichier principal
Vignette du fichier
cr1004245762281.pdf (403.25 Ko) Télécharger le fichier
Loading...

Dates et versions

inria-00103696 , version 1 (05-10-2006)

Identifiants

  • HAL Id : inria-00103696 , version 1

Citer

Clément Chatelain, Laurent Heutte, Thierry Paquet. Discrimination Between Digits and Outliers in Handwritten Documents Applied to the Extraction of Numerical Fields. Tenth International Workshop on Frontiers in Handwriting Recognition, Université de Rennes 1, Oct 2006, La Baule (France). ⟨inria-00103696⟩
56 Consultations
117 Téléchargements

Partager

Gmail Facebook X LinkedIn More