Skip to Main content Skip to Navigation
Journal articles

An automatic key discovery approach for data linking

Abstract : In the context of Linked Data, different kinds of semantic links can be established between data. However when data sources are huge, detecting such links manually is not feasible. One of the most important types of links, the identity link, expresses that different identifiers refer to the same real world entity. Some automatic data linking approaches use keys to infer identity links, nevertheless this kind of knowledge is rarely available. In this work we propose KD2R, an approach which allows the automatic discovery of composite keys in RDF data sources that may conform to different schemas. We only consider data sources for which the Unique Name Assumption is fulfilled. The obtained keys are correct with respect to the RDF data sources in which they are discovered. The proposed algorithm is scalable since it allows the key discovery without having to scan all the data. KD2R has been tested on real datasets of the international contest OAEI 2010 and on data sets available on the web of data, and has obtained promising results.
Complete list of metadata

Cited literature [30 references]  Display  Hide  Download
Contributor : Fatiha Saïs Connect in order to contact the contributor
Submitted on : Friday, February 19, 2016 - 4:25:09 PM
Last modification on : Sunday, June 26, 2022 - 12:05:13 PM
Long-term archiving on: : Friday, May 20, 2016 - 11:44:49 AM


Files produced by the author(s)




Nathalie Pernelle, Fatiha Saïs, Danai Symeonidou. An automatic key discovery approach for data linking. Journal of Web Semantics, Elsevier, 2013, 23, pp.16--30. ⟨10.1016/j.websem.2013.07.001⟩. ⟨hal-01276599⟩



Record views


Files downloads