ML LGDec 11, 2025

TPV: Parameter Perturbations Through the Lens of Test Prediction Variance

arXiv:2512.11089v3Has Code

Originality Incremental advance

AI Analysis

This provides a unified framework for analyzing generalization in deep networks, though it is incremental in linking existing observations.

The paper introduces test prediction variance (TPV) as a measure of model sensitivity to parameter perturbations, showing it converges from training to test data in overparameterized limits and correlates with test loss as a generalization predictor.

We identify test prediction variance (TPV)-- the first-order sensitivity of model outputs to parameter perturbations around a trained solution-- as a unifying quantity that links several classical observations about generalization in deep networks. TPV is a fully label-free object whose trace form separates the geometry of the trained model from the specific perturbation mechanism, allowing a broad family of parameter perturbations like SGD noise, label noise, finite-precision noise, and other post-training perturbations to be analyzed under a single framework. Theoretically, we show that TPV estimated on the training set converges to its test-set value in the overparameterized limit, providing the first result that prediction variance under local parameter perturbations can be inferred from training inputs alone, and this stability is decoupled from generalization performance. Empirically, TPV exhibits a striking stability across datasets and architectures even for extremely narrow networks. Further, TPV correlates well with test loss, serving as a training-set based predictive metric for generalization. Code available at github.com/devansharpit/TPV.

View on arXiv PDF Code

Similar