COMP-PHLGAug 18, 2025

Generalization vs. Memorization in Autoregressive Deep Learning: Or, Examining Temporal Decay of Gradient Coherence

arXiv:2509.00024v11 citationsh-index: 84
Originality Incremental advance
AI Analysis

This addresses a critical problem for scientific discovery using foundation models, where reliable generalization is necessary for novel insights and robust deployment.

The paper tackles the challenge of distinguishing genuine generalization from memorization in autoregressive PDE surrogate models, revealing fundamental limitations in standard models and training routines while providing actionable design insights.

Foundation models trained as autoregressive PDE surrogates hold significant promise for accelerating scientific discovery through their capacity to both extrapolate beyond training regimes and efficiently adapt to downstream tasks despite a paucity of examples for fine-tuning. However, reliably achieving genuine generalization - a necessary capability for producing novel scientific insights and robustly performing during deployment - remains a critical challenge. Establishing whether or not these requirements are met demands evaluation metrics capable of clearly distinguishing genuine model generalization from mere memorization. We apply the influence function formalism to systematically characterize how autoregressive PDE surrogates assimilate and propagate information derived from diverse physical scenarios, revealing fundamental limitations of standard models and training routines in addition to providing actionable insights regarding the design of improved surrogates.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes