LG NE MN MLMar 29, 2020

Learning Latent Causal Structures with a Redundant Input Neural Network

Jonathan D. Young, Bryan Andrews, Gregory F. Cooper, Xinghua Lu

arXiv:2003.13135v34.210 citations

Originality Incremental advance

AI Analysis

This addresses the open problem of latent causal discovery for researchers in causal inference and machine learning, but it is incremental as it builds on known input-output causal settings with simulation-based validation.

The paper tackled the problem of learning causal structures among latent variables from high-dimensional data where inputs cause outputs, and developed a redundant input neural network (RINN) that successfully recovered latent causal structures in simulation experiments.

Most causal discovery algorithms find causal structure among a set of observed variables. Learning the causal structure among latent variables remains an important open problem, particularly when using high-dimensional data. In this paper, we address a problem for which it is known that inputs cause outputs, and these causal relationships are encoded by a causal network among a set of an unknown number of latent variables. We developed a deep learning model, which we call a redundant input neural network (RINN), with a modified architecture and a regularized objective function to find causal relationships between input, hidden, and output variables. More specifically, our model allows input variables to directly interact with all latent variables in a neural network to influence what information the latent variables should encode in order to generate the output variables accurately. In this setting, the direct connections between input and latent variables makes the latent variables partially interpretable; furthermore, the connectivity among the latent variables in the neural network serves to model their potential causal relationships to each other and to the output variables. A series of simulation experiments provide support that the RINN method can successfully recover latent causal structure between input and output variables.

View on arXiv PDF

Similar