HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Journal articles

Combining a Logical and a Numerical Method for Data Reconciliation

Fatiha Saïs 1, 2 Nathalie Pernelle 1, 2 Marie-Christine Rousset 3, 4
2 GEMO - Integration of data and knowledge distributed over the web
LRI - Laboratoire de Recherche en Informatique, UP11 - Université Paris-Sud - Paris 11, Inria Saclay - Ile de France, CNRS - Centre National de la Recherche Scientifique : UMR8623
Abstract : The reference reconciliation problem consists in deciding whether different identifiers refer to the same data, i.e. correspond to the same real world entity. In this article we present a reference reconciliation approach which combines a logical method for reference reconciliation called L2R and a numerical one called N2R. This approach exploits the schema and data semantics, which is translated into a set of Horn FOL rules of reconciliation. These rules are used in L2R to infer exact decisions both of reconciliation and non-reconciliation. In the second method N2R, the semantics of the schema is translated in an informed similarity measure which is used by a numerical computation of the similarity of reference pairs. This similarity measure is expressed in a non linear equation system, which is solved by using an iterative method. The experiments of the methods made on two different domains, show good results for both recall and precision. They can be used separately or in combination. We have shown that their combination allows to improve runtime performance.
Document type :
Journal articles
Complete list of metadata

Cited literature [34 references]  Display  Hide  Download

Contributor : Fatiha Saïs Connect in order to contact the contributor
Submitted on : Wednesday, November 18, 2009 - 10:47:41 AM
Last modification on : Friday, February 4, 2022 - 3:08:58 AM
Long-term archiving on: : Thursday, June 17, 2010 - 6:49:46 PM


Files produced by the author(s)


  • HAL Id : inria-00433007, version 1


Fatiha Saïs, Nathalie Pernelle, Marie-Christine Rousset. Combining a Logical and a Numerical Method for Data Reconciliation. Journal on Data Semantics, Springer, 2009, 12 (12), pp.66-94. ⟨inria-00433007⟩



Record views


Files downloads