Skip to Main content Skip to Navigation
Conference papers

Parallel Quotient Summarization of RDF Graphs

Pawel Guzewicz 1, 2 Ioana Manolescu 1, 2
2 CEDAR - Rich Data Analytics at Cloud Scale
LIX - Laboratoire d'informatique de l'École polytechnique [Palaiseau], Inria Saclay - Ile de France
Abstract : Discovering the structure and content of an RDF graph is hard for human users, due to its heterogeneity, complexity, and possibly large size. One class of tools for this task are structural RDF graph summaries, which allow users to grasp the different connections between RDF graph nodes. RDFQuotient graph summaries are a brand of structural summaries we developed. They are usually very compact, making them good for first-sight visual discovery. Existing algorithms for building these summaries are centralized, and require the graph to fit in memory. Going beyond, in this work we present novel algorithms for building RDFQuotient summaries in a parallel, shared-nothing architecture. We instantiate our algorithms to Apache Spark platform; our experiments demonstrate the merit of our approach.
Document type :
Conference papers
Complete list of metadata

Cited literature [20 references]  Display  Hide  Download
Contributor : Pawel Guzewicz Connect in order to contact the contributor
Submitted on : Tuesday, April 23, 2019 - 10:07:05 AM
Last modification on : Friday, April 30, 2021 - 10:04:40 AM


Files produced by the author(s)



Pawel Guzewicz, Ioana Manolescu. Parallel Quotient Summarization of RDF Graphs. SBD 2019 - International Workshop on Semantic Big Data, Jun 2019, Amsterdam, Netherlands. ⟨10.1145/3323878.3325809⟩. ⟨hal-02106521⟩



Les métriques sont temporairement indisponibles