Neural Network Parameter-optimization of Gaussian pmDAGs
This work addresses causal inference challenges for researchers by providing a novel graphical structure and algorithm, though it appears incremental in extending duality concepts to Gaussian distributions.
The authors tackled the problem of parameter optimization in latent variable causal models by establishing a duality between training a feed-forward neural network and optimizing parameters of Gaussian Bayesian networks, leading to an algorithm for parameter optimization and conditions for causal effect identifiability in Gaussian settings.
Finding the parameters of a latent variable causal model is central to causal inference and causal identification. In this article, we show that existing graphical structures that are used in causal inference are not stable under marginalization of Gaussian Bayesian networks, and present a graphical structure that faithfully represent margins of Gaussian Bayesian networks. We present the first duality between parameter optimization of a latent variable model and training a feed-forward neural network in the parameter space of the assumed family of distributions. Based on this observation, we develop an algorithm for parameter optimization of these graphical structures based on a given observational distribution. Then, we provide conditions for causal effect identifiability in the Gaussian setting. We propose an meta-algorithm that checks whether a causal effect is identifiable or not. Moreover, we lay a grounding for generalizing the duality between a neural network and a causal model from the Gaussian to other distributions.