Pattern-Based Approach to Table Extraction

Santosh K.C. 1, * Abdel Belaïd 1
* Auteur correspondant
1 READ - Recognition of writing and analysis of documents
LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : In this paper, we address a client-driven approach to automatically extract information content within the table in document images. We start with a graph-based representation of a set of key-fields selected by clients and perform graph mining in a document in order to learn them to produce a model. Such models are aimed to use to extract information content in the absence of clients. To avoid NP-hard general problem, our graph matching is based on relation assignment to see whether pairs of nodes are semantically identical. We have validated the concept by using a real-world industrial problem.
Type de document :
Communication dans un congrès
João M. Sanches, Luisa Micó, Jaime S. Cardoso. IbPRIA 2013: 6th Iberian Conference on Pattern Recognition and Image Analysis, Jun 2013, Madeira, Portugal. Springer, 2013, Pattern Recognition and Image Analysis
Liste complète des métadonnées

Littérature citée [14 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00788323
Contributeur : Santosh K.C. <>
Soumis le : jeudi 14 février 2013 - 18:51:07
Dernière modification le : jeudi 11 janvier 2018 - 06:25:25
Document(s) archivé(s) le : mercredi 15 mai 2013 - 03:57:16

Fichier

kc_IbPRIA_CR0.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00788323, version 1

Collections

Citation

Santosh K.C., Abdel Belaïd. Pattern-Based Approach to Table Extraction. João M. Sanches, Luisa Micó, Jaime S. Cardoso. IbPRIA 2013: 6th Iberian Conference on Pattern Recognition and Image Analysis, Jun 2013, Madeira, Portugal. Springer, 2013, Pattern Recognition and Image Analysis. 〈hal-00788323〉

Partager

Métriques

Consultations de la notice

167

Téléchargements de fichiers

269