Simplifying Entity Resolution on Web Data with Schema-agnostic, Non-iterative Matching - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Simplifying Entity Resolution on Web Data with Schema-agnostic, Non-iterative Matching

Résumé

Entity Resolution (ER) aims to identify different descriptions in various Knowledge Bases (KBs) that refer to the same entity. ER is challenged by the Variety, Volume and Veracity of descriptions published in the Web of Data. To address them, we propose the MinoanER framework that fulfills full automation and support of highly heterogeneous entities. MinoanER leverages a token-based similarity of entities to define a new metric that derives the similarity of neighboring entities from the most important relations, indicated only by statistics. For high efficiency, similarities are computed from a set of schema-agnostic blocks and processed in a non-iterative way that involves four threshold-free heuristics. We demonstrate that the effectiveness of MinoanER is comparable to existing ER tools over real KBs exhibiting low heterogeneity in terms of entity types and content. Yet, MinoanER outperforms state-of-the-art ER tools when matching highly heterogeneous KBs.
Fichier principal
Vignette du fichier
PID5235409.pdf (264.05 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01718040 , version 1 (27-02-2018)

Identifiants

  • HAL Id : hal-01718040 , version 1

Citer

Vasilis Efthymiou, George Papadakis, Kostas Stefanidis, Vassilis Christophides. Simplifying Entity Resolution on Web Data with Schema-agnostic, Non-iterative Matching. ICDE 2018 - 34th IEEE International Conference on Data Engineering, Apr 2018, Paris, France. pp.1-4. ⟨hal-01718040⟩

Collections

INRIA INRIA2
193 Consultations
248 Téléchargements

Partager

Gmail Facebook X LinkedIn More