Data Exchange with MapReduce: A First Cut

Abstract : Data exchange is one of the oldest database problems, being of both practical and theoretical interest. Given the pace at which heterogeneous data are published on the web, thanks to initiatives such as Linked Data and Open Science, scalability of data exchange becomes crucial. Pivotal to data exchange is the chase algorithm, which is a fixpoint algorithm to evaluate both source-to-target constraints and target constraints in the data exchange process. In this paper, we investigate how new programming models such as MapReduce can be used to implement the chase on large-scale data sources. To the best of our knowledge, how to exchange data at scale has not been investigated so far. We present an initial solution for chasing source-to-target tuple generating dependencies and target tuple-generating dependencies, and discuss open issues that need to be addressed to leverage MapReduce for the data exchange problem.
Type de document :
Communication dans un congrès
International Conference on Scientific and Statistical Database Management (SSDBM), Jul 2016, Budapest, Hungary. pp.4, Proceedings of the 28th International Conference on Scientific and Statistical Database Management (SSDBM). 〈10.1145/2949689.2949702〉
Liste complète des métadonnées

https://hal.inria.fr/hal-01401594
Contributeur : Angela Bonifati <>
Soumis le : mercredi 23 novembre 2016 - 15:50:11
Dernière modification le : jeudi 19 avril 2018 - 14:38:05

Identifiants

Citation

Khalid Belhajjame, Angela Bonifati. Data Exchange with MapReduce: A First Cut. International Conference on Scientific and Statistical Database Management (SSDBM), Jul 2016, Budapest, Hungary. pp.4, Proceedings of the 28th International Conference on Scientific and Statistical Database Management (SSDBM). 〈10.1145/2949689.2949702〉. 〈hal-01401594〉

Partager

Métriques

Consultations de la notice

209