LG OC MENov 25, 2022

Learning Large Causal Structures from Inverse Covariance Matrix via Sparse Matrix Decomposition

Shuyu Dong, Kento Uemura, Akito Fujii, Shuang Chang, Yusuke Koyanagi, Koji Maruhashi, Michèle Sebag

arXiv:2211.14221v33.31 citationsh-index: 47Has Code

Originality Incremental advance

AI Analysis

This work addresses computational challenges in causal inference for large-scale datasets, offering an efficient solution with potential applications in fields like neuroscience, though it is incremental as it builds on existing linear structural equation models.

The paper tackles the problem of learning large causal structures from observational data by proposing ICID, a method based on sparse matrix decomposition of the inverse covariance matrix, which efficiently identifies directed acyclic graphs and shows robustness to noise variance misspecification, achieving high accuracy on simulated fMRI data compared to state-of-the-art algorithms.

Learning causal structures from observational data is a fundamental problem facing important computational challenges when the number of variables is large. In the context of linear structural equation models (SEMs), this paper focuses on learning causal structures from the inverse covariance matrix. The proposed method, called ICID for Independence-preserving Decomposition from Inverse Covariance matrix, is based on continuous optimization of a matrix decomposition model that preserves the nonzero patterns of the inverse covariance matrix. Through theoretical and empirical evidences, we show that ICID efficiently identifies the sought directed acyclic graph (DAG) assuming the knowledge of noise variances. Moreover, ICID is shown empirically to be robust under bounded misspecification of noise variances in the case where the noise variances are non-equal. The proposed method enjoys a low complexity, as reflected by its time efficiency in the experiments, and also enables a novel regularization scheme that yields highly accurate solutions on the Simulated fMRI data (Smith et al., 2011) in comparison with state-of-the-art algorithms.

View on arXiv PDF Code

Similar