MLITLGDec 4, 2025

Learning Causality for Longitudinal Data

arXiv:2512.04980v1h-index: 2
Originality Incremental advance
AI Analysis

This work addresses causal inference challenges in longitudinal data for researchers and practitioners in fields like healthcare and economics, with incremental improvements to existing methods.

This thesis developed three methods for causal inference in high-dimensional time-varying data: CDVAE for estimating individual treatment effects with theoretical guarantees, an RNN-based framework for long-term counterfactual regression using contrastive predictive coding, and a model-agnostic interpretability layer for causal representation learning. Experiments showed CDVAE outperformed baselines and approached oracle performance, while the RNN framework achieved state-of-the-art results without transformer computational costs.

This thesis develops methods for causal inference and causal representation learning (CRL) in high-dimensional, time-varying data. The first contribution introduces the Causal Dynamic Variational Autoencoder (CDVAE), a model for estimating Individual Treatment Effects (ITEs) by capturing unobserved heterogeneity in treatment response driven by latent risk factors that affect only outcomes. CDVAE comes with theoretical guarantees on valid latent adjustment and generalization bounds for ITE error. Experiments on synthetic and real datasets show that CDVAE outperforms baselines, and that state-of-the-art models greatly improve when augmented with its latent substitutes, approaching oracle performance without access to true adjustment variables. The second contribution proposes an efficient framework for long-term counterfactual regression based on RNNs enhanced with Contrastive Predictive Coding (CPC) and InfoMax. It captures long-range dependencies under time-varying confounding while avoiding the computational cost of transformers, achieving state-of-the-art results and introducing CPC into causal inference. The third contribution advances CRL by addressing how latent causes manifest in observed variables. We introduce a model-agnostic interpretability layer based on the geometry of the decoder Jacobian. A sparse self-expression prior induces modular, possibly overlapping groups of observed features aligned with shared latent influences. We provide recovery guarantees in both disjoint and overlapping settings and show that meaningful latent-to-observed structure can be recovered without anchor features or single-parent assumptions. Scalable Jacobian-based regularization techniques are also developed.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes