XML data representation in Document Image Analysis

Abdel Belaid 1 Ingrid Falk 2 Yves Rangoni 1
1 READ - READ
LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
2 TALARIS - Natural Language Processing: representation, inference and semantics
Inria Nancy - Grand Est, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This paper presents the XML-based formats ALTO, TEI, METS used for Digital Libraries and their interest for data representation in a Document Image Analysis and Recognition (DIAR) process. In the first part we briefly present these formats with focus on their adequacy for structural representation and modeling of DIAR data. The second part shows how these formats can be used in a reverse engineering process. Their implementation as a data representation framework will be shown.
Type de document :
Communication dans un congrès
Flavio Bortolozzi and Robert Sabourin. 9th International Conference on Document Analysis and Recognition - ICDAR'07, Sep 2007, Curitiba, Brazil. IEEE Computer Society, pp.78-82, 2007
Liste complète des métadonnées

https://hal.inria.fr/inria-00176680
Contributeur : Yves Rangoni <>
Soumis le : jeudi 4 octobre 2007 - 13:25:51
Dernière modification le : jeudi 11 janvier 2018 - 06:21:35

Identifiants

  • HAL Id : inria-00176680, version 1

Collections

Citation

Abdel Belaid, Ingrid Falk, Yves Rangoni. XML data representation in Document Image Analysis. Flavio Bortolozzi and Robert Sabourin. 9th International Conference on Document Analysis and Recognition - ICDAR'07, Sep 2007, Curitiba, Brazil. IEEE Computer Society, pp.78-82, 2007. 〈inria-00176680〉

Partager

Métriques

Consultations de la notice

242