Skip to Main content Skip to Navigation
Conference papers

Learning Texture Features for Enhancement and Segmentation of Historical Document Images

Abstract : Many challenges and open issues related to the tremendous growth in digitizing collections of cultural heritage documents have been raised, such as information retrieval in digital libraries or analyzing page content of historical books. Recently, graphic/text segmentation in historical documents has posed specific challenges due to many particularities of historical document images (e.g. noise and degradation, presence of handwriting, overlapping layouts, great variability of page layout). To cope with those challenges, a method based on learning texture features for historical document image enhancement and segmentation is proposed in this article. The proposed method is based on using the simple linear iterative clustering (SLIC) superpixels, Gabor de-scriptors and support vector machines (SVM). It has been evaluated on 100 document images which have been selected from the databases of the competitions (i.e. historical document layout analysis and historical book recognition) in the context of ICDAR conference and HIP workshop (2011 and 2013). To demonstrate the enhancement and segmentation quality, the evaluation is based on manually labeled ground truth and shows the effectiveness of the proposed method through qualitative and numerical experiments. The proposed method provides interesting results on historical document images having various page layouts and different typographical and graphical properties.
Document type :
Conference papers
Complete list of metadata

Cited literature [26 references]  Display  Hide  Download

https://hal.inria.fr/hal-01237228
Contributor : Maroua Mehri <>
Submitted on : Wednesday, December 2, 2015 - 10:35:25 PM
Last modification on : Tuesday, December 8, 2020 - 10:23:59 AM
Long-term archiving on: : Thursday, March 3, 2016 - 3:01:31 PM

File

MarouaMEHRI_HIP_2015.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01237228, version 1

Citation

Maroua Mehri, Nibal Nayef, Pierre Héroux, Petra Gomez-Krämer, Rémy Mullot. Learning Texture Features for Enhancement and Segmentation of Historical Document Images. International Workshop on Historical Document Imaging and Processing (HIP), Aug 2015, Nancy, France. pp.47-54. ⟨hal-01237228⟩

Share

Metrics

Record views

311

Files downloads

591