Skip to Main content Skip to Navigation
Conference papers

Pattern-Based Approach to Table Extraction

Santosh K.C. 1, * Abdel Belaïd 1
* Corresponding author
1 READ - Recognition of writing and analysis of documents
LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : In this paper, we address a client-driven approach to automatically extract information content within the table in document images. We start with a graph-based representation of a set of key-fields selected by clients and perform graph mining in a document in order to learn them to produce a model. Such models are aimed to use to extract information content in the absence of clients. To avoid NP-hard general problem, our graph matching is based on relation assignment to see whether pairs of nodes are semantically identical. We have validated the concept by using a real-world industrial problem.
Complete list of metadata

Cited literature [14 references]  Display  Hide  Download

https://hal.inria.fr/hal-00788323
Contributor : Santosh K.C. <>
Submitted on : Thursday, February 14, 2013 - 6:51:07 PM
Last modification on : Friday, January 15, 2021 - 5:42:02 PM
Long-term archiving on: : Wednesday, May 15, 2013 - 3:57:16 AM

File

kc_IbPRIA_CR0.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00788323, version 1

Collections

Citation

Santosh K.C., Abdel Belaïd. Pattern-Based Approach to Table Extraction. IbPRIA 2013: 6th Iberian Conference on Pattern Recognition and Image Analysis, Jun 2013, Madeira, Portugal. ⟨hal-00788323⟩

Share

Metrics

Record views

229

Files downloads

826