DBAILOApr 7

Database Querying under Missing Values Governed by Missingness Mechanisms

arXiv:2604.0652014.8h-index: 36
Predicted impact top 87% in DB · last 90 daysOriginality Incremental advance
AI Analysis

This work addresses a foundational issue in database management for scenarios with missing data, offering a novel approach that moves beyond traditional NULL handling, though it is incremental in its application to specific query answering methods.

The paper tackles the problem of defining semantics and performing query answering on relational databases with missing values, where missingness is modeled via a Bayesian network. The result includes two query answering techniques that jointly capture probabilistic uncertainty and statistical plausibility, along with complexity results characterizing computational feasibility.

We address the problems of giving a semantics to- and doing query answering (QA) on a relational database (RDB) that has missing values (MVs). The causes for the latter are governed by a Missingness Mechanism that is modelled as a Bayesian Network, which represents a Missingness Graph (MG) and involves the DB attributes. Our approach considerable departs from the treatment of RDBs with NULL (values). The MG together with the observed DB allow to build a block-independent probabilistic DB, on which basis we propose two QA techniques that jointly capture probabilistic uncertainty and statistical plausibility of the implicit imputation of MVs. We obtain complexity results that characterize the computational feasibility of those approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes