Visual perception of unitary elements for layout analysis of unconstrained documents in heterogeneous databases - Archive ouverte HAL Access content directly
Conference Papers Year :

Visual perception of unitary elements for layout analysis of unconstrained documents in heterogeneous databases

(1, 2) , (3, 1) , (1, 2)
1
2
3

Abstract

The document layout analysis is a complex task in the context of heterogeneous documents. It is still a challenging problem. In this paper, we present our contribution for the layout analysis competition of the international Maurdor Cam-paign. Our method is based on a grammatical description of the content of elements. It consists in iteratively finding and then removing the most structuring elements of documents. This method is based on notions of perceptive vision: a combination of points of view of the document, and the analysis of salient contents. Our description is generic enough to deal with a very wide range of heterogeneous documents. This method obtained the second place in Run 2 of Maurdor Campaign (on 1000 documents), and the best results in terms of pixel labeling for text blocs and graphic regions.
Fichier principal
Vignette du fichier
PID3215629.pdf (2.79 Mo) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01088807 , version 1 (28-11-2014)

Identifiers

  • HAL Id : hal-01088807 , version 1

Cite

Baptiste Poirriez, Aurélie Lemaitre, Bertrand Coüasnon. Visual perception of unitary elements for layout analysis of unconstrained documents in heterogeneous databases. 14th International Conference on Frontiers in Handwriting Recognition (ICFHR-2014), Sep 2014, Crete island, Greece. ⟨hal-01088807⟩
337 View
151 Download

Share

Gmail Facebook Twitter LinkedIn More