LGAIMLOct 20, 2022

Independence Testing-Based Approach to Causal Discovery under Measurement Error and Linear Non-Gaussian Models

arXiv:2210.11021v118 citationsh-index: 47
Originality Incremental advance
AI Analysis

This addresses the problem of causal inference under measurement error for researchers in statistics and machine learning, offering a more scalable method than existing approaches.

The paper tackles causal discovery when observed variables are noisy measurements of unobserved target variables, proposing a Transformed Independent Noise condition that enables identification of causal structure using independence tests instead of non-scalable over-complete independent component analysis, with experimental validation on synthetic and real-world data.

Causal discovery aims to recover causal structures generating the observational data. Despite its success in certain problems, in many real-world scenarios the observed variables are not the target variables of interest, but the imperfect measures of the target variables. Causal discovery under measurement error aims to recover the causal graph among unobserved target variables from observations made with measurement error. We consider a specific formulation of the problem, where the unobserved target variables follow a linear non-Gaussian acyclic model, and the measurement process follows the random measurement error model. Existing methods on this formulation rely on non-scalable over-complete independent component analysis (OICA). In this work, we propose the Transformed Independent Noise (TIN) condition, which checks for independence between a specific linear transformation of some measured variables and certain other measured variables. By leveraging the non-Gaussianity and higher-order statistics of data, TIN is informative about the graph structure among the unobserved target variables. By utilizing TIN, the ordered group decomposition of the causal model is identifiable. In other words, we could achieve what once required OICA to achieve by only conducting independence tests. Experimental results on both synthetic and real-world data demonstrate the effectiveness and reliability of our method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes