An Indexing Framework for Queries on Probabilistic Graphs

Abstract : Information in many applications, such as mobile wireless systems, social networks, and road networks, is captured by graphs. In many cases, such information is uncertain. We study the problem of querying a probabilistic graph, in which vertices are connected to each other probabilistically. In particular, we examine “source-to-target” queries (or ST-queries), such as computing the shortest path between two vertices. The major difference with the deterministic setting is that query answers are enriched with probabilistic annotations. Evaluating ST-queries over probabilistic graphs is #P-hard, as it requires examining an exponential number of “possible worlds” – database instances generated from the probabilistic graph. Existing solutions to the ST-query problem, which sample possible worlds, have two downsides: (i) a possible world can be very large and (ii) many samples are needed for reasonable accuracy. To tackle these issues, we study the ProbTree, a data structure that stores a succinct, or indexed, version of the possible worlds of the graph. Existing ST-query solutions are executed on top of this structure, with the number of samples and sizes of the possible worlds reduced. We examine lossless and lossy methods for generating the ProbTree, which reflect the trade-off between the accuracy and efficiency of query evaluation. We analyze the correctness and complexity of these approaches. Our extensive experiments on real datasets show that the ProbTree is fast to generate and small in size. It also enhances the accuracy and efficiency of existing ST-query algorithms significantly.
Document type :
Journal articles
Complete list of metadatas

Cited literature [47 references]  Display  Hide  Download

https://hal.inria.fr/hal-01437580
Contributor : Pierre Senellart <>
Submitted on : Tuesday, January 17, 2017 - 1:29:05 PM
Last modification on : Thursday, February 7, 2019 - 2:42:37 PM

File

maniu2017indexing.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01437580, version 1

Citation

Silviu Maniu, Reynold Cheng, Pierre Senellart. An Indexing Framework for Queries on Probabilistic Graphs. ACM Trans. Datab. Syst, 2017. ⟨hal-01437580⟩

Share

Metrics

Record views

269

Files downloads

206