LGAIMar 8

OrthoFormer: Instrumental Variable Estimation in Transformer Hidden States via Neural Control Functions

arXiv:2603.07431v1
Predicted impact top 98% in LG · last 90 daysOriginality Highly original
AI Analysis

This work provides a foundational shift from correlational to causal sequence modeling for researchers and practitioners, improving robustness and interpretability under distribution shift.

Transformers struggle with spurious correlations from latent confounders, leading to out-of-distribution failures. OrthoFormer addresses this by embedding instrumental variable estimation into Transformer blocks using neural control functions, achieving strictly less bias than OLS for any valid instrument lag, with residual bias decaying geometrically.

Transformer architectures excel at sequential modeling yet remain fundamentally limited by correlational learning - they capture spurious associations induced by latent confounders rather than invariant causal mechanisms. We identify this as an epistemological challenge: standard Transformers conflate static background factors (intrinsic identity, style, context) with dynamic causal flows (state evolution, mechanism), leading to catastrophic out-of-distribution failure. We propose OrthoFormer, a causally grounded architecture that embeds instrumental variable estimation directly into Transformer blocks via neural control functions. Our framework rests on four theoretical pillars: Structural Directionality (time-arrow enforcement), Representation Orthogonality (latent-noise separation), Causal Sparsity (Markov Blanket approximation), and End-to-End Consistency (gradient- detached stage separation). We prove that OrthoFormer achieves bias strictly less than OLS for any valid instrument lag, with residual bias decaying geometrically as O(\r{ho}k ). We characterize the bias-variance-exogeneity trilemma inherent in self-instrumenting and identify the neural forbidden regression - where removing gradient detachment improves prediction loss while destroying causal validity. Experiments confirm all theoretical predictions. OrthoFormer represents a paradigm shift from correlational to causal sequence modeling, with implications for robustness, interpretability, and reliable decision-making under distribution shift.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes