inria-00176680, version 1
XML data representation in Document Image Analysis
Abdel Belaid a, 1Ingrid Falk 2Yves Rangoni
a, 1
9th International Conference on Document Analysis and Recognition - ICDAR'07 (2007) 78-82
Résumé : This paper presents the XML-based formats ALTO, TEI, METS used for Digital Libraries and their interest for data representation in a Document Image Analysis and Recognition (DIAR) process. In the first part we briefly present these formats with focus on their adequacy for structural representation and modeling of DIAR data. The second part shows how these formats can be used in a reverse engineering process. Their implementation as a data representation framework will be shown.
- a – Université Nancy II
- 1 : READ (LORIA)
- INRIA – CNRS : UMR7503 – Université Henri Poincaré - Nancy I – Université Nancy II – Institut National Polytechnique de Lorraine (INPL)
- 2 : TALARIS (INRIA Lorraine - LORIA)
- CNRS : UMR7503 – INRIA – Université Henri Poincaré - Nancy I – Université Nancy II – Institut National Polytechnique de Lorraine (INPL)
- Domaine : Informatique/Vision par ordinateur et reconnaissance de formes
Informatique/Traitement du texte et du document - Mots-clés : XML – TEI – ALTO – METS – Document Image Analysis and Recognition – XSLT – Reverse Engineering – Document Class Model
- inria-00176680, version 1
- http://hal.inria.fr/inria-00176680
- oai:hal.inria.fr:inria-00176680
- Contributeur : Yves Rangoni
- Soumis le : Jeudi 4 Octobre 2007, 13:25:51
- Dernière modification le : Mardi 31 Mai 2011, 10:31:29






Exporter