Scaling Large RDF Archives To Very Long Histories - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2023

Scaling Large RDF Archives To Very Long Histories

Résumé

In recent years, research in RDF archiving has gained traction due to the ever-growing nature of semantic data and the emergence of community-maintained knowledge bases. Several solutions have been proposed to manage the history of large RDF graphs, including approaches based on independent copies, time-based indexes, and change-based schemes. In particular, aggregated changesets have been shown to be relatively efficient at handling very large datasets. However, ingestion time can still become prohibitive as the revision history increases. To tackle this challenge, we propose a hybrid storage approach based on aggregated changesets, snapshots, and multiple delta chains. We evaluate different snapshot creation strategies on the BEAR benchmark for RDF archives, and show that our techniques can speed up ingestion time up to two orders of magnitude while keeping competitive performance for version materialization and delta queries. This allows us to support revision histories of lengths that are beyond reach with existing approaches.
Fichier principal
Vignette du fichier
ICSC_2023.pdf (710.96 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04388912 , version 1 (11-01-2024)

Licence

Paternité

Identifiants

Citer

Olivier Pelgrin, Ruben Taelman, Luis Galárraga, Katja Hose. Scaling Large RDF Archives To Very Long Histories. ICSC 2023 - IEEE 17th International Conference on Semantic Computing, Feb 2023, Laguna Hills, United States. pp.41-48, ⟨10.1109/ICSC56153.2023.00013⟩. ⟨hal-04388912⟩
12 Consultations
8 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More