Combining a Logical and a Numerical Method for Data Reconciliation

The reference reconciliation problem consists in deciding whether different identifiers refer to the same data, i.e. correspond to the same real world entity. In this article we present a reference reconciliation approach which combines a logical method for reference reconciliation called L2R and a numerical one called N2R. This approach exploits the schema and data semantics, which is translated into a set of Horn FOL rules of reconciliation. These rules are used in L2R to infer exact decisions both of reconciliation and non-reconciliation. In the second method N2R, the semantics of the schema is translated in an informed similarity measure which is used by a numerical computation of the similarity of reference pairs. This similarity measure is expressed in a non linear equation system, which is solved by using an iterative method. The experiments of the methods made on two different domains, show good results for both recall and precision. They can be used separately or in combination. We have shown that their combination allows to improve runtime performance.

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

Sais-Pernelle-Rousset-JoDSXII-Camera-Ready.pdf (386.17 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Fatiha Saïs : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00433007

Soumis le : mercredi 18 novembre 2009-10:47:41

Dernière modification le : jeudi 4 avril 2024-21:32:28

Archivage à long terme le : jeudi 17 juin 2010-18:49:46

Dates et versions

inria-00433007 , version 1 (18-11-2009)

Identifiants

HAL Id : inria-00433007 , version 1

Citer

Fatiha Saïs, Nathalie Pernelle, Marie-Christine Rousset. Combining a Logical and a Numerical Method for Data Reconciliation. Journal on Data Semantics, 2009, 12 (12), pp.66-94. ⟨inria-00433007⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UGA CNRS INRIA LIG LIG_TDCGE LIG_TDCGE_HADAS UMR8623 INRIA2 UNIV-PARIS-SACLAY LIG_SIDCH

315 Consultations

391 Téléchargements