Visual perception of unitary elements for layout analysis of unconstrained documents in heterogeneous databases

Abstract : The document layout analysis is a complex task in the context of heterogeneous documents. It is still a challenging problem. In this paper, we present our contribution for the layout analysis competition of the international Maurdor Cam-paign. Our method is based on a grammatical description of the content of elements. It consists in iteratively finding and then removing the most structuring elements of documents. This method is based on notions of perceptive vision: a combination of points of view of the document, and the analysis of salient contents. Our description is generic enough to deal with a very wide range of heterogeneous documents. This method obtained the second place in Run 2 of Maurdor Campaign (on 1000 documents), and the best results in terms of pixel labeling for text blocs and graphic regions.
Type de document :
Communication dans un congrès
14th International Conference on Frontiers in Handwriting Recognition (ICFHR-2014), Sep 2014, Crete island, Greece
Liste complète des métadonnées

Littérature citée [8 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01088807
Contributeur : Aurélie Lemaitre <>
Soumis le : vendredi 28 novembre 2014 - 16:58:03
Dernière modification le : jeudi 5 avril 2018 - 12:30:15
Document(s) archivé(s) le : vendredi 14 avril 2017 - 23:10:15

Fichier

PID3215629.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01088807, version 1

Citation

Baptiste Poirriez, Aurélie Lemaitre, Bertrand Coüasnon. Visual perception of unitary elements for layout analysis of unconstrained documents in heterogeneous databases. 14th International Conference on Frontiers in Handwriting Recognition (ICFHR-2014), Sep 2014, Crete island, Greece. 〈hal-01088807〉

Partager

Métriques

Consultations de la notice

373

Téléchargements de fichiers

131