Querying Regular Sets of XML Documents

Slawomir Staworko; Emmanuel Filiot; Jan Chomicki

Communication Dans Un Congrès Année : 2008

Querying Regular Sets of XML Documents

(1) , (1) , (2)

1
2

Slawomir Staworko

Fonction : Auteur
PersonId : 5221
IdHAL : staworko
ORCID : 0000-0003-3684-3395
IdRef : 169337472

Modeling Tree Structures, Machine Learning, and Information Extraction

Emmanuel Filiot

Fonction : Auteur
PersonId : 848587

Modeling Tree Structures, Machine Learning, and Information Extraction

Jan Chomicki

Fonction : Auteur
PersonId : 848676

CSE at Buffalo

Résumé

We investigate the problem of querying (regular) sets of XML documents represented with tree automata and we consider $n$-ary tree automata queries whose expressive power captures MSO on trees. Because finite automata can represent infinite sets of documents, we propose the notions of {\em universal} and {\em existential} query answers, answers that are present resp. in all and some documents. We study complexity of query answering and show that computing existential query answers is in PTIME if we assume the arity of the query to be a fixed parameter. On the other hand, computing universal query answers is EXPTIME-complete, but we show that it is in PTIME if we assume that the query is fixed (data complexity). Finally, we argue that the framework captures problems central to many novel XML applications like querying inconsistent XML documents. In particular, we demonstrate how to use our framework to compute consistent query answers in XML documents that do not satisfy the schema. This solution significantly extends our previous results in this area.

Domaines

Base de données [cs.DB]

Fichier principal

staworko-lid08.pdf (194.61 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Slawomir Staworko : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00275491

Soumis le : jeudi 24 avril 2008-10:45:39

Dernière modification le : vendredi 24 mars 2023-14:52:50

Archivage à long terme le : vendredi 28 septembre 2012-13:01:22

Dates et versions

inria-00275491 , version 1 (24-04-2008)

Identifiants

HAL Id : inria-00275491 , version 1

Citer

Slawomir Staworko, Emmanuel Filiot, Jan Chomicki. Querying Regular Sets of XML Documents. International Workshop on Logic in Databases (LiD), May 2008, Rome, Italy. ⟨inria-00275491⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA MOSTRARE INRIA2

117 Consultations

53 Téléchargements

Querying Regular Sets of XML Documents

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager