Skip to Main content Skip to Navigation
Conference papers

Revisiting RDF storage layouts for efficient query answering

Maxime Buron 1, 2 François Goasdoué 3 Ioana Manolescu 1, 2 Tayeb Merabti 1, 2 Marie-Laure Mugnier 4
1 CEDAR - Rich Data Analytics at Cloud Scale
LIX - Laboratoire d'informatique de l'École polytechnique [Palaiseau], Inria Saclay - Ile de France
3 SHAMAN - A Symbolic and Human-centric view of dAta MANagement
IRISA-D7 - GESTION DES DONNÉES ET DE LA CONNAISSANCE
4 GRAPHIK - Graphs for Inferences on Knowledge
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : The performance of query answering in an RDF database strongly depends on the data layout, that is, the way data is split in persistent data structures. We consider answering Basic Graph Pattern Queries (BGPQs), and in particular those with variables (also) in class and property positions, in the presence of RDFS ontologies, both through data saturation and query reformulation. We show that such demanding queries often lead to inefficient query answering on two popular storage layouts, so-called T and CP. We present novel query answering algorithms on the TCP layout, which combines T and CP. In exchange to occupying more storage space, e.g. on an inexpensive disk, TCP avoids the bad or even catastrophic performance that T and/or CP sometimes exhibit. We also introduce summary-based pruning, a novel technique based on existing RDF quotient summaries, which improves query answering performance on the T, CP and the more robust TCP layouts.
Complete list of metadata

Cited literature [22 references]  Display  Hide  Download

https://hal.inria.fr/hal-02921457
Contributor : Maxime Buron <>
Submitted on : Tuesday, August 25, 2020 - 11:19:33 AM
Last modification on : Friday, April 30, 2021 - 10:04:24 AM
Long-term archiving on: : Tuesday, December 1, 2020 - 7:08:30 AM

File

main.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02921457, version 1

Citation

Maxime Buron, François Goasdoué, Ioana Manolescu, Tayeb Merabti, Marie-Laure Mugnier. Revisiting RDF storage layouts for efficient query answering. SSWS 2020 - 13th International Workshop on Scalable Semantic Web Knowledge Base Systems, Nov 2020, Athène, Greece. ⟨hal-02921457⟩

Share

Metrics

Record views

132

Files downloads

409