HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Theses

Cross-Model Queries and Schemas: Complexity and Learning

Radu Ciucanu 1, 2
1 LINKS - Linking Dynamic Data
CRIStAL - Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189, Inria Lille - Nord Europe
Abstract : Specifying a database query using a formal query language is typically a challenging task for non-expert users. In the context of big data, this problem becomes even harder because it requires the users to deal with database instances of large size and hence difficult to visualize. Such instances usually lack a schema to help the users specify their queries, or have an incomplete schema as they come from disparate data sources. In this thesis, we address the problem of query specification for non-expert users. We identify two possible approaches for tackling this problem: learning queries from examples and translating the data in a format that the user finds easier to query. Our contributions are aligned with these two complementary directions and span over three of the most popular data models: XML, relational, and graph. This thesis consists of two parts, dedicated to (i) schema definition and translation, and to (ii) learning schemas and queries. In the first part, we define schema formalisms for unordered XML and we analyze their computational properties; we also study the complexity of the data exchange problem in the setting of a relational source and a graph target database. In the second part, we investigate the problem of learning from examples the schemas for unordered XML proposed in the first part, as well as relational join queries and path queries on graph databases. The interactive scenario that we propose for these two classes of queries is immediately applicable to assisting non-expert users in the process of query specification.
Document type :
Theses
Complete list of metadata

Cited literature [61 references]  Display  Hide  Download

https://hal.inria.fr/tel-01182649
Contributor : Radu Ciucanu Connect in order to contact the contributor
Submitted on : Saturday, August 1, 2015 - 6:12:19 PM
Last modification on : Wednesday, March 23, 2022 - 3:51:21 PM
Long-term archiving on: : Monday, November 2, 2015 - 10:14:14 AM

Identifiers

  • HAL Id : tel-01182649, version 1

Citation

Radu Ciucanu. Cross-Model Queries and Schemas: Complexity and Learning. Databases [cs.DB]. Université Lille 1 - Sciences et Technologies, 2015. English. ⟨tel-01182649⟩

Share

Metrics

Record views

415

Files downloads

323