KEFT: Knowledge Extraction and Graph Building from Statistical Data Tables
Résumé
Data provided by statistical models are commonly represented by textual, tabular or graphical form in documents (reports, articles, posters and presentations). These documents are often available in PDF format. Even though it makes accessing a particular information more difficult, it is interesting to process the PDF documents directly. We present KEFT, a solution in the statistical domain and we describe the fully functional pipeline to constructing a knowledge graph by extracting entities and relations from statistical Data Tables. We showcase how this approach can be used to construct a knowledge graph from different statistical studies.