Client-Driven Content Extraction Associated with Table - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

Client-Driven Content Extraction Associated with Table

Résumé

The goal of the project is to extract content within table in document images based on learnt patterns. Real-world users i.e., clients first provide a set of key fields within the table which they think are important. These are first used to represent the graph where nodes are labelled with semantics including other features and edges are attributed with relations. Attributed relational graph (ARG) is then employed to mine similar graphs from a document image. Each mined graph will represent an item within the table, and hence a set of such graphs will compose a table. We have validated the concept by using a real-world industrial problem.
Fichier principal
Vignette du fichier
kc_mva2.pdf (579.87 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00808678 , version 1 (05-04-2013)

Identifiants

Citer

Santosh K.C., Abdel Belaïd. Client-Driven Content Extraction Associated with Table. IAPR MVA - The Thirteenth IAPR International Conference on Machine Vision Applications - 2013, May 2013, Kyoto, Japan. ⟨hal-00808678⟩
128 Consultations
69 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More