Résumé : The Semantic Web standardized by the World Wide Web Consortium aims at providing a common framework that allows data to be shared and analyzed across applications. The Resource Description Framework (RDF) and the query language SPARQL constitute two major components of this vision.
Because of the increasing amounts of RDF data available, dataset distribution across clusters is poised to become a standard storage method. As a consequence, efficient and distributed sparql evaluators are needed.
To tackle these needs, we first benchmark several state-of-the-art distributed SPARQL evaluators while monitoring a set of metrics which is appropriate in a distributed context (e.g. network traffic). Then, an analysis driven by typical use cases leads us to define new development perspectives in the field of distributed sparql evaluation. On the basis of these perspectives, we design several efficient distributed SPARQL evaluators whose performances are validated and compared to state- of-the-art evaluators. For instance, our distributed SPARQL evaluator named SPARQLGX offers efficient time performances while being resilient to the loss of nodes.