LG AI MLOct 20, 2022

Causal Structural Hypothesis Testing and Data Generation Models

Jeffrey Jiang, Omead Pooladzandi, Sunay Bhat, Gregory Pottie

arXiv:2210.11275v23.31 citationsh-index: 39Has Code

Originality Incremental advance

AI Analysis

This work addresses the challenge of leveraging expert causal knowledge for improved data generation and generalization, offering a practical tool for practitioners to test and synthesize datasets, though it appears incremental in applying existing methods to causal contexts.

The paper tackles the problem of testing causal structural priors for generalization and data synthesis by proposing a novel model architecture that uses deep neural networks to approximate causal models and compares priors via hypothesis testing, demonstrating effectiveness through simulations and validation on synthetic and real-world datasets.

A vast amount of expert and domain knowledge is captured by causal structural priors, yet there has been little research on testing such priors for generalization and data synthesis purposes. We propose a novel model architecture, Causal Structural Hypothesis Testing, that can use nonparametric, structural causal knowledge and approximate a causal model's functional relationships using deep neural networks. We use these architectures for comparing structural priors, akin to hypothesis testing, using a deliberate (non-random) split of training and testing data. Extensive simulations demonstrate the effectiveness of out-of-distribution generalization error as a proxy for causal structural prior hypothesis testing and offers a statistical baseline for interpreting results. We show that the variational version of the architecture, Causal Structural Variational Hypothesis Testing can improve performance in low SNR regimes. Due to the simplicity and low parameter count of the models, practitioners can test and compare structural prior hypotheses on small dataset and use the priors with the best generalization capacity to synthesize much larger, causally-informed datasets. Finally, we validate our methods on a synthetic pendulum dataset, and show a use-case on a real-world trauma surgery ground-level falls dataset.

View on arXiv PDF Code

Similar