Skip to Main content Skip to Navigation
Conference papers

Web-Scale Blocking, Iterative and Progressive Entity Resolution

Abstract : Entity resolution aims to identify descriptions of the same entity within or across knowledge bases. In this work, we provide a comprehensive and cohesive overview of the key research results in the area of entity resolution. We are interested in frameworks addressing the new challenges in entity resolution posed by the Web of data in which real world entities are described by interlinked data rather than documents. Since such descriptions are usually partial, overlapping and sometimes evolving, entity resolution emerges as a central problem both to increase dataset linking, but also to search the Web of data for entities and their relations. We focus on Web-scale blocking, iterative and progressive solutions for entity resolution. Specifically, to reduce the required number of comparisons, blocking is performed to place similar descriptions into blocks and executes comparisons to identify matches only between descriptions within the same block. To minimize the number of missed matches, an iterative entity resolution process can exploit any intermediate results of blocking and matching, discovering new candidate description pairs for resolution. Finally, we overview works on progressive entity resolution, which attempt to discover as many matches as possible given limited computing budget, by estimating the matching likelihood of yet unresolved descriptions, based on the matches found so far.
Document type :
Conference papers
Complete list of metadata

Cited literature [23 references]  Display  Hide  Download
Contributor : Vassilis Christophides Connect in order to contact the contributor
Submitted on : Thursday, January 18, 2018 - 8:44:12 AM
Last modification on : Thursday, December 10, 2020 - 10:54:58 AM
Long-term archiving on: : Sunday, May 6, 2018 - 12:23:09 PM


Files produced by the author(s)



Kostas Stefanidis, Vassilis Christophides, Vasilis Efthymiou. Web-Scale Blocking, Iterative and Progressive Entity Resolution. ICDE 2017 - 33rd IEEE International Conference on Data Engineering, Apr 2017, San Diego, CA, United States. pp.1-4, ⟨10.1109/ICDE.2017.214⟩. ⟨hal-01664035⟩



Record views


Files downloads