A Machine Learning Approach to SPARQL Query Performance Prediction

Rakebul Hasan 1 Fabien Gandon 1
1 WIMMICS - Web-Instrumented Man-Machine Interactions, Communities and Semantics
CRISAM - Inria Sophia Antipolis - Méditerranée , Laboratoire I3S - SPARKS - Scalable and Pervasive softwARe and Knowledge Systems
Abstract : In this paper we address the problem of predicting SPARQL query performance. We use machine learning techniques to learn SPARQL query performance from previously executed queries. Traditional approaches for estimating SPARQL query cost are based on statistics about the underlying data. However, in many use-cases involving querying Linked Data, statistics about the underlying data are often missing. Our approach does not require any statistics about the underlying RDF data, which makes it ideal for the Linked Data scenario. We show how to model SPARQL queries as feature vectors, and use k-nearest neighbors regression and Support Vector Machine with the nu-SVR kernel to accurately predict SPARQL query execution time.
Document type :
Conference papers
Complete list of metadatas

Cited literature [23 references]  Display  Hide  Download

https://hal.inria.fr/hal-01075484
Contributor : Rakebul Hasan <>
Submitted on : Friday, October 17, 2014 - 4:57:04 PM
Last modification on : Monday, November 5, 2018 - 3:52:09 PM
Long-term archiving on : Sunday, January 18, 2015 - 10:41:37 AM

File

bare_conf.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01075484, version 1

Collections

Citation

Rakebul Hasan, Fabien Gandon. A Machine Learning Approach to SPARQL Query Performance Prediction. The 2014 IEEE/WIC/ACM International Conference on Web Intelligence, Aug 2014, Warsaw, Poland. ⟨hal-01075484⟩

Share

Metrics

Record views

505

Files downloads

907