Skip to Main content Skip to Navigation
Conference papers

Navigating the Maze of Wikidata Query Logs

Angela Bonifati 1, 2 Wim Martens 3 Thomas Timm 3
1 BD - Base de Données
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
2 TYREX - Types and Reasoning for the Web
Inria Grenoble - Rhône-Alpes, LIG - Laboratoire d'Informatique de Grenoble
Abstract : This paper provides an in-depth and diversified analysis of the Wikidata query logs, recently made publicly available. Although the usage of Wikidata queries has been the object of recent studies, our analysis of the query traffic reveals interesting and unforeseen findings concerning the usage, types of recursion, and the shape classification of complex recursive queries. Wikidata specific features combined with recursion let us identify a significant subset of the entire corpus that can be used by the community for further assessment. We considered and analyzed the queries across many different dimensions, such as the robotic and organic queries, the presence/absence of constants along with the correctly executed and timed out queries. A further investigation that we pursue in this paper is to find, given a query, a number of queries structurally similar to the given query. We provide a thorough characterization of the queries in terms of their expressive power, their topological structure and shape, along with a deeper understanding of the usage of recursion in these logs. We make the code for the analysis available as open source.
Document type :
Conference papers
Complete list of metadata

Cited literature [32 references]  Display  Hide  Download
Contributor : Tyrex Equipe Connect in order to contact the contributor
Submitted on : Friday, September 25, 2020 - 11:54:12 AM
Last modification on : Wednesday, November 3, 2021 - 6:45:53 AM
Long-term archiving on: : Thursday, December 3, 2020 - 5:44:04 PM


Files produced by the author(s)



Angela Bonifati, Wim Martens, Thomas Timm. Navigating the Maze of Wikidata Query Logs. WWW 2019 - The World Wide Web Conference, May 2019, San Francisco, United States. pp.127-138, ⟨10.1145/3308558.3313472⟩. ⟨hal-02096714⟩



Record views


Files downloads