Skip to Main content Skip to Navigation
Conference papers

XML data representation in Document Image Analysis

Abdel Belaid 1 Ingrid Falk 2 Yves Rangoni 1
1 READ - READ
LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
2 TALARIS - Natural Language Processing: representation, inference and semantics
Inria Nancy - Grand Est, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This paper presents the XML-based formats ALTO, TEI, METS used for Digital Libraries and their interest for data representation in a Document Image Analysis and Recognition (DIAR) process. In the first part we briefly present these formats with focus on their adequacy for structural representation and modeling of DIAR data. The second part shows how these formats can be used in a reverse engineering process. Their implementation as a data representation framework will be shown.
Complete list of metadatas

https://hal.inria.fr/inria-00176680
Contributor : Yves Rangoni <>
Submitted on : Thursday, October 4, 2007 - 1:25:51 PM
Last modification on : Thursday, January 11, 2018 - 6:21:35 AM

Identifiers

  • HAL Id : inria-00176680, version 1

Collections

Citation

Abdel Belaid, Ingrid Falk, Yves Rangoni. XML data representation in Document Image Analysis. 9th International Conference on Document Analysis and Recognition - ICDAR'07, IAPR, Sep 2007, Curitiba, Brazil. pp.78-82. ⟨inria-00176680⟩

Share

Metrics

Record views

275