Skip to Main content Skip to Navigation
Conference papers

Pattern-Based Approach to Table Extraction

Santosh K.C. 1, * Abdel Belaïd 1 
* Corresponding author
1 READ - Recognition of writing and analysis of documents
LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : In this paper, we address a client-driven approach to automatically extract information content within the table in document images. We start with a graph-based representation of a set of key-fields selected by clients and perform graph mining in a document in order to learn them to produce a model. Such models are aimed to use to extract information content in the absence of clients. To avoid NP-hard general problem, our graph matching is based on relation assignment to see whether pairs of nodes are semantically identical. We have validated the concept by using a real-world industrial problem.
Complete list of metadata

Cited literature [14 references]  Display  Hide  Download
Contributor : Santosh K.C. Connect in order to contact the contributor
Submitted on : Thursday, February 14, 2013 - 6:51:07 PM
Last modification on : Saturday, October 16, 2021 - 11:26:09 AM
Long-term archiving on: : Wednesday, May 15, 2013 - 3:57:16 AM


Files produced by the author(s)


  • HAL Id : hal-00788323, version 1



Santosh K.C., Abdel Belaïd. Pattern-Based Approach to Table Extraction. IbPRIA 2013: 6th Iberian Conference on Pattern Recognition and Image Analysis, Jun 2013, Madeira, Portugal. ⟨hal-00788323⟩



Record views


Files downloads