Navigating the Maze of Wikidata Query Logs - Archive ouverte HAL Access content directly
Conference Papers Year : 2019

Navigating the Maze of Wikidata Query Logs

(1, 2) , (3) , (3)
1
2
3

Abstract

This paper provides an in-depth and diversified analysis of the Wikidata query logs, recently made publicly available. Although the usage of Wikidata queries has been the object of recent studies, our analysis of the query traffic reveals interesting and unforeseen findings concerning the usage, types of recursion, and the shape classification of complex recursive queries. Wikidata specific features combined with recursion let us identify a significant subset of the entire corpus that can be used by the community for further assessment. We considered and analyzed the queries across many different dimensions, such as the robotic and organic queries, the presence/absence of constants along with the correctly executed and timed out queries. A further investigation that we pursue in this paper is to find, given a query, a number of queries structurally similar to the given query. We provide a thorough characterization of the queries in terms of their expressive power, their topological structure and shape, along with a deeper understanding of the usage of recursion in these logs. We make the code for the analysis available as open source.
Fichier principal
Vignette du fichier
3308558.3313472.pdf (988.34 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-02096714 , version 1 (25-09-2020)

Identifiers

Cite

Angela Bonifati, Wim Martens, Thomas Timm. Navigating the Maze of Wikidata Query Logs. WWW 2019 - The World Wide Web Conference, May 2019, San Francisco, United States. pp.127-138, ⟨10.1145/3308558.3313472⟩. ⟨hal-02096714⟩
393 View
383 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More