A structural method based on texture for ancient document image analysis - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

A structural method based on texture for ancient document image analysis

Résumé

A structural signature based on texture for the characterization and categorization of digitized historical book pages is proposed in my research work. The proposed signature does not assume a priori knowledge regarding page layout and content, and hence, it is applicable to a large variety of ancient books. By integrating varying low-level features (e.g. texture) characterizing the different page components (i.e. different text fonts or graphic regions) on the one hand, and structural information describing the page layout on the other hand, the proposed signature provides a rich and holistic description of the layout and content of the analyzed book pages. More precisely, the signature-based characterization approach consists of two stages. The first stage is extracting automatically homogeneous regions. Then, the second one is proposing a graph-based page signature, which is based on the extracted homogeneous regions, reflecting its layout and content. This signature ensures the implementation of numerous applications for managing effectively a corpus or collections of books (e.g. information retrieval in digital libraries according to several criteria, or page categorization). To illustrate the effectiveness of the proposed page signature, a detailed experimental evaluation has been conducted in this work for assessing two possible categorization applications, unsupervised page classification and page stream segmentation. In addition, the different steps of the proposed approach have been evaluated on a large variety of historical document images.
Fichier principal
Vignette du fichier
MarouaMEHRI_DoctoralConsortium_ICDAR_2015.pdf (103.87 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01250512 , version 1 (04-01-2016)

Identifiants

  • HAL Id : hal-01250512 , version 1

Citer

Mehri Maroua, Pierre Héroux, Petra Gomez-Krämer, Rémy Mullot. A structural method based on texture for ancient document image analysis. ICDAR - Doctoral Consortium, Aug 2015, Nancy, France. ⟨hal-01250512⟩
104 Consultations
105 Téléchargements

Partager

Gmail Facebook X LinkedIn More