Web-Scale Blocking, Iterative and Progressive Entity Resolution

Abstract : Entity resolution aims to identify descriptions of the same entity within or across knowledge bases. In this work, we provide a comprehensive and cohesive overview of the key research results in the area of entity resolution. We are interested in frameworks addressing the new challenges in entity resolution posed by the Web of data in which real world entities are described by interlinked data rather than documents. Since such descriptions are usually partial, overlapping and sometimes evolving, entity resolution emerges as a central problem both to increase dataset linking, but also to search the Web of data for entities and their relations. We focus on Web-scale blocking, iterative and progressive solutions for entity resolution. Specifically, to reduce the required number of comparisons, blocking is performed to place similar descriptions into blocks and executes comparisons to identify matches only between descriptions within the same block. To minimize the number of missed matches, an iterative entity resolution process can exploit any intermediate results of blocking and matching, discovering new candidate description pairs for resolution. Finally, we overview works on progressive entity resolution, which attempt to discover as many matches as possible given limited computing budget, by estimating the matching likelihood of yet unresolved descriptions, based on the matches found so far.
Type de document :
Communication dans un congrès
ICDE 2017 - 33rd IEEE International Conference on Data Engineering, Apr 2017, San Diego, CA, United States. IEEE, Data Engineering (ICDE), 2017 IEEE 33rd International Conference on, pp.1-4, 〈http://icde2017.sdsc.edu/〉. 〈10.1109/ICDE.2017.214〉
Liste complète des métadonnées

Littérature citée [23 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01664035
Contributeur : Vassilis Christophides <>
Soumis le : jeudi 18 janvier 2018 - 08:44:12
Dernière modification le : jeudi 26 avril 2018 - 10:27:59
Document(s) archivé(s) le : dimanche 6 mai 2018 - 12:23:09

Fichier

ICDE17_icdeposter_615.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Kostas Stefanidis, Vassilis Christophides, Vasilis Efthymiou. Web-Scale Blocking, Iterative and Progressive Entity Resolution. ICDE 2017 - 33rd IEEE International Conference on Data Engineering, Apr 2017, San Diego, CA, United States. IEEE, Data Engineering (ICDE), 2017 IEEE 33rd International Conference on, pp.1-4, 〈http://icde2017.sdsc.edu/〉. 〈10.1109/ICDE.2017.214〉. 〈hal-01664035〉

Partager

Métriques

Consultations de la notice

79

Téléchargements de fichiers

43