MLLGEMMEJan 12, 2024

Deep Learning With DAGs

arXiv:2401.06864v18 citationsh-index: 18Has CodeSSRN
Originality Incremental advance
AI Analysis

This method addresses a bottleneck in social science research by providing a more accurate and assumption-free way to empirically evaluate causal theories, though it appears incremental as it builds on existing DAG and deep learning techniques.

The authors tackled the problem of evaluating causal theories represented as directed acyclic graphs (DAGs) without relying on arbitrary functional form assumptions, which can introduce bias, by introducing causal-graphical normalizing flows (cGNFs) that model the full joint distribution using deep neural networks, enabling flexible estimation of various causal effects.

Social science theories often postulate causal relationships among a set of variables or events. Although directed acyclic graphs (DAGs) are increasingly used to represent these theories, their full potential has not yet been realized in practice. As non-parametric causal models, DAGs require no assumptions about the functional form of the hypothesized relationships. Nevertheless, to simplify the task of empirical evaluation, researchers tend to invoke such assumptions anyway, even though they are typically arbitrary and do not reflect any theoretical content or prior knowledge. Moreover, functional form assumptions can engender bias, whenever they fail to accurately capture the complexity of the causal system under investigation. In this article, we introduce causal-graphical normalizing flows (cGNFs), a novel approach to causal inference that leverages deep neural networks to empirically evaluate theories represented as DAGs. Unlike conventional approaches, cGNFs model the full joint distribution of the data according to a DAG supplied by the analyst, without relying on stringent assumptions about functional form. In this way, the method allows for flexible, semi-parametric estimation of any causal estimand that can be identified from the DAG, including total effects, conditional effects, direct and indirect effects, and path-specific effects. We illustrate the method with a reanalysis of Blau and Duncan's (1967) model of status attainment and Zhou's (2019) model of conditional versus controlled mobility. To facilitate adoption, we provide open-source software together with a series of online tutorials for implementing cGNFs. The article concludes with a discussion of current limitations and directions for future development.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes