inria-00137138, version 2
On the Complexity of Managing Probabilistic XML Data
Pierre Senellart
1Serge Abiteboul
1
Principles Of Database Systems (2007)
Résumé : In [3], we introduced a framework for querying and updating probabilistic information over unordered labeled trees, the probabilistic tree model. The data model is based on trees where nodes are annotated with conjunctions of probabilistic event variables. We briefly described an implementation and scenarios of usage. We develop here a mathematical foundation for this model. In particular, we present complexity results. We identify a very large class of queries for which simple variations of querying and updating algorithms from [3] compute the correct answer. A main contribution is a full complexity analysis of queries and updates. We also exhibit a decision procedure for the equivalence of probabilistic trees and prove it is in co-RP. Furthermore, we study the issue of removing less probable possible worlds, and that of validating a probabilistic tree against a DTD. We show that these two problems are intractable in the most general case.
- 1 : GEMO (INRIA Futurs)
- INRIA – CNRS : UMR8623 – Université Paris XI - Paris Sud
- Domaine : Informatique/Base de données
- Mots-clés : XML – probabilistic databases – semi-structured databases – complexity
- Versions disponibles : v1 (16-03-2007) v2 (21-03-2007)
- inria-00137138, version 2
- http://hal.inria.fr/inria-00137138
- oai:hal.inria.fr:inria-00137138
- Contributeur : Pierre Senellart
- Soumis le : Mercredi 21 Mars 2007, 20:54:23
- Dernière modification le : Jeudi 26 Avril 2007, 23:25:53






Documents associés
Exporter