AI LG MLDec 14, 2020

A Single Iterative Step for Anytime Causal Discovery

Raanan Y. Rohekar, Yaniv Gurwicz, Shami Nisimov, Gal Novik

arXiv:2012.07513v24.11 citations

Originality Incremental advance

AI Analysis

This work provides a more efficient method for causal graph discovery, which is beneficial for researchers and practitioners working with complex observational datasets in various domains.

This paper introduces an algorithm for causal discovery from observational data, handling latent confounders and selection bias. It iteratively refines the causal graph, ensuring correctness at each step and becoming more informative over time. The algorithm significantly reduces the number of conditional independence tests and the size of condition sets compared to the FCI algorithm, both with a perfect oracle and limited data.

We present a sound and complete algorithm for recovering causal graphs from observed, non-interventional data, in the possible presence of latent confounders and selection bias. We rely on the causal Markov and faithfulness assumptions and recover the equivalence class of the underlying causal graph by performing a series of conditional independence (CI) tests between observed variables. We propose a single step that is applied iteratively, such that the independence and causal relations entailed from the resulting graph, after any iteration, is correct and becomes more informative with successive iteration. Essentially, we tie the size of the CI condition set to its distance from the tested nodes on the resulting graph. Each iteration refines the skeleton and orientation by performing CI tests having condition sets that are larger than in the preceding iteration. In an iteration, condition sets of CI tests are constructed from nodes that are within a specified search distance, and the sizes of these condition sets is equal to this search distance. The algorithm then iteratively increases the search distance along with the condition set sizes. Thus, each iteration refines a graph, that was recovered by previous iterations having smaller condition sets -- having a higher statistical power. We demonstrate that our algorithm requires significantly fewer CI tests and smaller condition sets compared to the FCI algorithm. This is evident for both recovering the true underlying graph using a perfect CI oracle, and accurately estimating the graph using limited observed data.

View on arXiv PDF

Similar