An automatic key discovery approach for data linking

Abstract : In the context of Linked Data, different kinds of semantic links can be established between data. However when data sources are huge, detecting such links manually is not feasible. One of the most important types of links, the identity link, expresses that different identifiers refer to the same real world entity. Some automatic data linking approaches use keys to infer identity links, nevertheless this kind of knowledge is rarely available. In this work we propose KD2R, an approach which allows the automatic discovery of composite keys in RDF data sources that may conform to different schemas. We only consider data sources for which the Unique Name Assumption is fulfilled. The obtained keys are correct with respect to the RDF data sources in which they are discovered. The proposed algorithm is scalable since it allows the key discovery without having to scan all the data. KD2R has been tested on real datasets of the international contest OAEI 2010 and on data sets available on the web of data, and has obtained promising results.
Type de document :
Article dans une revue
Journal of Web Semantics, Elsevier, 2013, 23, pp.16--30. 〈10.1016/j.websem.2013.07.001〉
Liste complète des métadonnées

Littérature citée [30 références]  Voir  Masquer  Télécharger
Contributeur : Fatiha Saïs <>
Soumis le : vendredi 19 février 2016 - 16:25:09
Dernière modification le : mardi 24 avril 2018 - 13:54:43
Document(s) archivé(s) le : vendredi 20 mai 2016 - 11:44:49


Fichiers produits par l'(les) auteur(s)




Nathalie Pernelle, Fatiha Saïs, Danai Symeonidou. An automatic key discovery approach for data linking. Journal of Web Semantics, Elsevier, 2013, 23, pp.16--30. 〈10.1016/j.websem.2013.07.001〉. 〈hal-01276599〉



Consultations de la notice


Téléchargements de fichiers