LGAIMEMLMar 23

On the Number of Conditional Independence Tests in Constraint-based Causal Discovery

arXiv:2603.2184424.81 citationsh-index: 4
AI Analysis

This addresses a fundamental bottleneck in causal inference for fields like genetics and social sciences, offering a significant improvement over existing methods, though it is incremental within constraint-based approaches.

The paper tackles the problem of high computational complexity in constraint-based causal discovery by proposing an algorithm that reduces the number of conditional independence tests to $p^{\\mathcal{O}(s)}$, where $p$ is the number of nodes and $s$ is the maximum undirected clique size, and proves a lower bound of $2^{\\Omega(s)}$ tests, showing exponent-optimality up to a logarithmic factor.

Learning causal relations from observational data is a fundamental problem with wide-ranging applications across many fields. Constraint-based methods infer the underlying causal structure by performing conditional independence tests. However, existing algorithms such as the prominent PC algorithm need to perform a large number of independence tests, which in the worst case is exponential in the maximum degree of the causal graph. Despite extensive research, it remains unclear if there exist algorithms with better complexity without additional assumptions. Here, we establish an algorithm that achieves a better complexity of $p^{\mathcal{O}(s)}$ tests, where $p$ is the number of nodes in the graph and $s$ denotes the maximum undirected clique size of the underlying essential graph. Complementing this result, we prove that any constraint-based algorithm must perform at least $2^{Ω(s)}$ conditional independence tests, establishing that our proposed algorithm achieves exponent-optimality up to a logarithmic factor in terms of the number of conditional independence tests needed. Finally, we validate our theoretical findings through simulations, on semi-synthetic gene-expression data, and real-world data, demonstrating the efficiency of our algorithm compared to existing methods in terms of number of conditional independence tests needed.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes