LGMLFeb 6

Sample Complexity of Causal Identification with Temporal Heterogeneity

arXiv:2602.06899v1h-index: 46
Originality Incremental advance
AI Analysis

This work addresses the challenge of causal identification in non-stationary systems for researchers in causal inference, providing fundamental statistical limits, though it is incremental as it builds on existing methods by combining temporal and environmental heterogeneity.

The paper tackles the problem of uniquely recovering causal graphs from observational data by integrating temporal dynamics and multi-environment heterogeneity, showing that temporal structure can compensate for insufficient heterogeneity and that heavy-tailed noise significantly increases sample complexity compared to Gaussian cases.

Recovering a unique causal graph from observational data is an ill-posed problem because multiple generating mechanisms can lead to the same observational distribution. This problem becomes solvable only by exploiting specific structural or distributional assumptions. While recent work has separately utilized time-series dynamics or multi-environment heterogeneity to constrain this problem, we integrate both as complementary sources of heterogeneity. This integration yields unified necessary identifiability conditions and enables a rigorous analysis of the statistical limits of recovery under thin versus heavy-tailed noise. In particular, temporal structure is shown to effectively substitute for missing environmental diversity, possibly achieving identifiability even under insufficient heterogeneity. Extending this analysis to heavy-tailed (Student's t) distributions, we demonstrate that while geometric identifiability conditions remain invariant, the sample complexity diverges significantly from the Gaussian baseline. Explicit information-theoretic bounds quantify this cost of robustness, establishing the fundamental limits of covariance-based causal graph recovery methods in realistic non-stationary systems. This work shifts the focus from whether causal structure is identifiable to whether it is statistically recoverable in practice.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes