Coping with Incomplete Data: Recent Advances

Marco Console 1 Paolo Guagliardo 1 Leonid Libkin 1, 2 Etienne Toussaint 1
Abstract : Handling incomplete data in a correct manner is a notoriously hard problem in databases. Theoretical approaches rely on the computationally hard notion of certain answers, while practical solutions rely on ad hoc query evaluation techniques based on threevalued logic. Can we find a middle ground, and produce correct answers efficiently? The paper surveys results of the last few years motivated by this question. We reexamine the notion of certainty itself, and show that it is much more varied than previously thought. We identify cases when certain answers can be computed efficiently and, short of that, provide deterministic and probabilistic approximation schemes for them. We look at the role of three-valued logic as used in SQL query evaluation, and discuss the correctness of the choice, as well as the necessity of such a logic for producing query answers.
Marco Console, Paolo Guagliardo, Leonid Libkin, Etienne Toussaint. Coping with Incomplete Data: Recent Advances. SIGMOD/PODS 2020 - International Conference on Management of Data, Jun 2020, Portland / Virtual, United States. pp.33-47



