gMark: Schema-Driven Generation of Graphs and Queries

Guillaume Bagan 1 Angela Bonifati 2, 3 Radu Ciucanu 4, 5 George Fletcher 6 Aurélien Lemay 7 Nicky Advokaat 6
1 GOAL - Graphes, AlgOrithmes et AppLications
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
2 BD - Base de Données
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
7 LINKS - Linking Dynamic Data
Inria Lille - Nord Europe, CRIStAL - Centre de Recherche en Informatique, Signal et Automatique de Lille (CRIStAL) - UMR 9189
Abstract : Massive graph data sets are pervasive in contemporary application domains. Hence, graph database systems are becoming increasingly important. In the experimental study of these systems, it is vital that the research community has shared solutions for the generation of database instances and query workloads having predictable and controllable properties. We present the design and engineering principles of gMark, a domain- and query language-independent graph instance and query workload generator. A core contribution of gMark is its ability to target and control the diversity of properties of both the generated instances and the generated workloads coupled to these instances. Further novelties include support for regular path queries, a fundamental graph query paradigm, and schema-driven selectivity estimation of queries, a key feature in controlling workload chokepoints. We illustrate the flexibility and practical usability of gMark by showcasing the framework's capabilities in generating high quality graphs and workloads, and its ability to encode user-defined schemas across a variety of application domains.
Type de document :
Communication dans un congrès
Data Engineering (ICDE), 2017 IEEE 33rd International Conference on, Apr 2017, San Diego, United States. 〈http://ieeexplore.ieee.org/document/7929934/〉. 〈10.1109/ICDE.2017.38〉
Liste complète des métadonnées

https://hal.inria.fr/hal-01591706
Contributeur : Radu Ciucanu <>
Soumis le : jeudi 21 septembre 2017 - 19:14:17
Dernière modification le : mercredi 25 avril 2018 - 15:42:38

Lien texte intégral

Identifiants

Citation

Guillaume Bagan, Angela Bonifati, Radu Ciucanu, George Fletcher, Aurélien Lemay, et al.. gMark: Schema-Driven Generation of Graphs and Queries. Data Engineering (ICDE), 2017 IEEE 33rd International Conference on, Apr 2017, San Diego, United States. 〈http://ieeexplore.ieee.org/document/7929934/〉. 〈10.1109/ICDE.2017.38〉. 〈hal-01591706〉

Partager

Métriques

Consultations de la notice

320