Pay-As-You-Go Data Integration Using Functional Dependencies

Naser Ayat 1 Hamideh Afsarmanesh 1 Reza Akbarinia 2 Patrick Valduriez 2
2 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : Setting up a full data integration system for many application contexts, e.g. web and scientific data management, requires significant human effort which prevents it from being really scalable. In this paper, we propose IFD (Integration based on Functional Dependencies), a pay-as-you-go data integration system that allows integrating a given set of data sources, as well as incrementally integrating additional sources. IFD takes advantage of the background knowledge implied within functional dependencies for matching the source schemas. Our system is built on a probabilistic data model that allows capturing the uncertainty in data integration systems. Our performance evaluation results show significant performance gains of our approach in terms of recall and precision compared to the baseline approaches. They confirm the importance of functional dependencies and also the contribution of using a probabilistic data model in improving the quality of schema matching. The analytical study and experiments show that IFD scales well.
Type de document :
Communication dans un congrès
Gerald Quirchmayr; Josef Basl; Ilsun You; Lida Xu; Edgar Weippl. International Cross-Domain Conference and Workshop on Availability, Reliability, and Security (CD-ARES), Aug 2012, Prague, Czech Republic. Springer, Lecture Notes in Computer Science, LNCS-7465, pp.375-389, 2012, Multidisciplinary Research and Practice for Information Systems. 〈10.1007/978-3-642-32498-7_28〉
Liste complète des métadonnées

Littérature citée [14 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01542460
Contributeur : Hal Ifip <>
Soumis le : lundi 19 juin 2017 - 17:01:39
Dernière modification le : jeudi 24 mai 2018 - 15:59:21
Document(s) archivé(s) le : vendredi 15 décembre 2017 - 22:21:30

Fichier

978-3-642-32498-7_28_Chapter.p...
Fichiers produits par l'(les) auteur(s)

Licence


Distributed under a Creative Commons Paternité 4.0 International License

Identifiants

Citation

Naser Ayat, Hamideh Afsarmanesh, Reza Akbarinia, Patrick Valduriez. Pay-As-You-Go Data Integration Using Functional Dependencies. Gerald Quirchmayr; Josef Basl; Ilsun You; Lida Xu; Edgar Weippl. International Cross-Domain Conference and Workshop on Availability, Reliability, and Security (CD-ARES), Aug 2012, Prague, Czech Republic. Springer, Lecture Notes in Computer Science, LNCS-7465, pp.375-389, 2012, Multidisciplinary Research and Practice for Information Systems. 〈10.1007/978-3-642-32498-7_28〉. 〈hal-01542460〉

Partager

Métriques

Consultations de la notice

239

Téléchargements de fichiers

33