Model-Based Annotation of Online Handwritten Datasets - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2006

Model-Based Annotation of Online Handwritten Datasets

Résumé

Annotated datasets of handwriting are a prerequisite to attempt a variety of problems such as building recognizers, developing writer identication algorithms, etc. However, the annotation of large datasets is a tedious and expensive process, especially at the character or stroke level. In this paper we propose a novel, automated method for annotation at the character level, given a parallel corpus of online handwritten data and the corresponding text. The method employs a model-based handwriting synthesis unit to map the two corpora to the same space and the annotation is propagated to the word level and then to the individual characters using elastic matching. The initial results of annotation are used to improve the handwriting synthesis model for the user under consideration, which in turn renes the annotation. The method can take care of errors in the handwriting such as spurious and missing strokes or characters. The output is stored in the UPXInkML format.
Fichier principal
Vignette du fichier
cr112642680558.pdf (356.68 Ko) Télécharger le fichier
Loading...

Dates et versions

inria-00105158 , version 1 (10-10-2006)

Identifiants

  • HAL Id : inria-00105158 , version 1

Citer

Anand Kumar, A. Balasubramanian, Anoop Namboodiri, C.V. Jawahar. Model-Based Annotation of Online Handwritten Datasets. Tenth International Workshop on Frontiers in Handwriting Recognition, Université de Rennes 1, Oct 2006, La Baule (France). ⟨inria-00105158⟩

Collections

IWFHR10
166 Consultations
181 Téléchargements

Partager

Gmail Facebook X LinkedIn More