On Leveraging Crowdsourcing Techniques for Schema Matching Networks

Abstract : As the number of publicly-available datasets are likely to grow, the demand of establishing the links between these datasets is also getting higher and higher. For creating such links we need to match their schemas. Moreover, for using these datasets in meaningful ways, one often needs to match not only two, but several schemas. This matching process establishes a (potentially large) set of attribute correspondences between multiple schemas that constitute a schema matching network. Various commercial and academic schema matching tools have been developed to support this task. However, as the matching is inherently uncertain, the heuristic techniques adopted by these tools give rise to results that are not completely correct. Thus, in practice, a post-matching human expert effort is needed to obtain a correct set of attribute correspondences. Addressing this problem, our paper demonstrates how to leverage crowdsourcing techniques to validate the generated correspondences. We design validation questions with contextual information that can effectively guide the crowd workers. We analyze how to reduce overall human effort needed for this validation task. Through theoretical and empirical results, we show that by harnessing natural constraints defined on top of the schema matching network, one can significantly reduce the necessary human work.
Type de document :
Communication dans un congrès
The 18th International Conference on Database Systems for Advanced Applications, Apr 2013, Wuhan, China. 2013
Liste complète des métadonnées

Littérature citée [27 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00812037
Contributeur : Zoltan Miklos <>
Soumis le : jeudi 11 avril 2013 - 15:39:48
Dernière modification le : lundi 2 octobre 2017 - 16:06:04
Document(s) archivé(s) le : lundi 3 avril 2017 - 04:27:55

Fichier

dasfaa.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00812037, version 1

Citation

Hung Nguyen Quoc Viet, Tam Nguyen Thanh, Zoltan Miklos, Karl Aberer. On Leveraging Crowdsourcing Techniques for Schema Matching Networks. The 18th International Conference on Database Systems for Advanced Applications, Apr 2013, Wuhan, China. 2013. 〈hal-00812037〉

Partager

Métriques

Consultations de
la notice

277

Téléchargements du document

382