A structural method based on texture for ancient document image analysis

Abstract : A structural signature based on texture for the characterization and categorization of digitized historical book pages is proposed in my research work. The proposed signature does not assume a priori knowledge regarding page layout and content, and hence, it is applicable to a large variety of ancient books. By integrating varying low-level features (e.g. texture) characterizing the different page components (i.e. different text fonts or graphic regions) on the one hand, and structural information describing the page layout on the other hand, the proposed signature provides a rich and holistic description of the layout and content of the analyzed book pages. More precisely, the signature-based characterization approach consists of two stages. The first stage is extracting automatically homogeneous regions. Then, the second one is proposing a graph-based page signature, which is based on the extracted homogeneous regions, reflecting its layout and content. This signature ensures the implementation of numerous applications for managing effectively a corpus or collections of books (e.g. information retrieval in digital libraries according to several criteria, or page categorization). To illustrate the effectiveness of the proposed page signature, a detailed experimental evaluation has been conducted in this work for assessing two possible categorization applications, unsupervised page classification and page stream segmentation. In addition, the different steps of the proposed approach have been evaluated on a large variety of historical document images.
Type de document :
Communication dans un congrès
ICDAR - Doctoral Consortium, Aug 2015, Nancy, France. 2015
Liste complète des métadonnées

Littérature citée [25 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01250512
Contributeur : Maroua Mehri <>
Soumis le : lundi 4 janvier 2016 - 22:23:38
Dernière modification le : mardi 5 juin 2018 - 10:14:25

Fichier

MarouaMEHRI_DoctoralConsortium...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01250512, version 1

Citation

Mehri Maroua, Pierre Héroux, Petra Gomez-Krämer, Rémy Mullot. A structural method based on texture for ancient document image analysis. ICDAR - Doctoral Consortium, Aug 2015, Nancy, France. 2015. 〈hal-01250512〉

Partager

Métriques

Consultations de la notice

108

Téléchargements de fichiers

108