Scan-to-XML for Vector Graphics: an experimental setup for intelligent browsable document generation

Abstract : This paper describes an experimental setup, conducted in collaboration with the ISA research group of the LORIA laboratory, Océ-PLT, and students from the École des Mines de Nancy. The main objective is to experiment an approach to develop a high level document analysis platform by composing existing bricks from a comprehensive library of state-of-the art algorithms. The test-case of this methodology consists in the realization of a fully automated method of generating a browsable, hyper-linked document from a simple scanned image. We concentrated our work on cutaway diagrams. These documents present the advantage of containing simple browsing semantics, in the sense that they consist of a clearly identifiable legend containing index references, plus a drawing containing one or more occurrences of the same indices. The setup described in this paper starts from a raw binary image of a cutaway diagram, and delivers an XML description matching the references of the legend with the indices in the image, and a browser for interpreting the XML generated map. The complete document treatment pipeline is conceived within a combined scripting and compiled library environment.
Type de document :
Communication dans un congrès
Fourth IAPR International Workshop on Graphics Recognition, 2001, Kingston, Ontario, Canada, 14 p, 2001
Liste complète des métadonnées

https://hal.inria.fr/inria-00100445
Contributeur : Publications Loria <>
Soumis le : mardi 26 septembre 2006 - 14:45:26
Dernière modification le : mardi 25 octobre 2016 - 16:59:04

Identifiants

  • HAL Id : inria-00100445, version 1

Collections

Citation

Bart Lamiroy, Laurent Najman, Romain Ehrhard, Céline Louis, Franck Quélain, et al.. Scan-to-XML for Vector Graphics: an experimental setup for intelligent browsable document generation. Fourth IAPR International Workshop on Graphics Recognition, 2001, Kingston, Ontario, Canada, 14 p, 2001. <inria-00100445>

Partager

Métriques

Consultations de la notice

245