LGIRDec 5, 2024

Graph Disentangle Causal Model: Enhancing Causal Inference in Networked Observational Data

arXiv:2412.03913v15 citationsh-index: 8WSDM
Originality Incremental advance
AI Analysis

This addresses causal inference in networked data for domains like social science or healthcare, but it is incremental as it builds on existing graph-based methods by adding disentanglement.

The paper tackles the problem of estimating individual treatment effects from observational data with hidden confounders by proposing the Graph Disentangle Causal model, which separates features into adjustment and confounder representations using graph aggregation, and demonstrates effectiveness through experiments on two networked datasets.

Estimating individual treatment effects (ITE) from observational data is a critical task across various domains. However, many existing works on ITE estimation overlook the influence of hidden confounders, which remain unobserved at the individual unit level. To address this limitation, researchers have utilized graph neural networks to aggregate neighbors' features to capture the hidden confounders and mitigate confounding bias by minimizing the discrepancy of confounder representations between the treated and control groups. Despite the success of these approaches, practical scenarios often treat all features as confounders and involve substantial differences in feature distributions between the treated and control groups. Confusing the adjustment and confounder and enforcing strict balance on the confounder representations could potentially undermine the effectiveness of outcome prediction. To mitigate this issue, we propose a novel framework called the \textit{Graph Disentangle Causal model} (GDC) to conduct ITE estimation in the network setting. GDC utilizes a causal disentangle module to separate unit features into adjustment and confounder representations. Then we design a graph aggregation module consisting of three distinct graph aggregators to obtain adjustment, confounder, and counterfactual confounder representations. Finally, a causal constraint module is employed to enforce the disentangled representations as true causal factors. The effectiveness of our proposed method is demonstrated by conducting comprehensive experiments on two networked datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes