Web-Scale Blocking, Iterative and Progressive Entity Resolution

Abstract : Entity resolution aims to identify descriptions of the same entity within or across knowledge bases. In this work, we provide a comprehensive and cohesive overview of the key research results in the area of entity resolution. We are interested in frameworks addressing the new challenges in entity resolution posed by the Web of data in which real world entities are described by interlinked data rather than documents. Since such descriptions are usually partial, overlapping and sometimes evolving, entity resolution emerges as a central problem both to increase dataset linking, but also to search the Web of data for entities and their relations. We focus on Web-scale blocking, iterative and progressive solutions for entity resolution. Specifically, to reduce the required number of comparisons, blocking is performed to place similar descriptions into blocks and executes comparisons to identify matches only between descriptions within the same block. To minimize the number of missed matches, an iterative entity resolution process can exploit any intermediate results of blocking and matching, discovering new candidate description pairs for resolution. Finally, we overview works on progressive entity resolution, which attempt to discover as many matches as possible given limited computing budget, by estimating the matching likelihood of yet unresolved descriptions, based on the matches found so far.
Document type :
Conference papers
Complete list of metadatas

Cited literature [23 references]  Display  Hide  Download

https://hal.inria.fr/hal-01664035
Contributor : Vassilis Christophides <>
Submitted on : Thursday, January 18, 2018 - 8:44:12 AM
Last modification on : Wednesday, May 15, 2019 - 3:35:46 AM
Long-term archiving on : Sunday, May 6, 2018 - 12:23:09 PM

File

ICDE17_icdeposter_615.pdf
Files produced by the author(s)

Identifiers

Citation

Kostas Stefanidis, Vassilis Christophides, Vasilis Efthymiou. Web-Scale Blocking, Iterative and Progressive Entity Resolution. ICDE 2017 - 33rd IEEE International Conference on Data Engineering, Apr 2017, San Diego, CA, United States. pp.1-4, ⟨10.1109/ICDE.2017.214⟩. ⟨hal-01664035⟩

Share

Metrics

Record views

134

Files downloads

126