MLLGSTFeb 5

Fast Rates for Nonstationary Weighted Risk Minimization

arXiv:2602.05742v1
Originality Incremental advance
AI Analysis

This work addresses prediction under nonstationary distributions for machine learning practitioners, offering theoretical guarantees that are incremental improvements over existing stationary analyses.

The paper tackles the problem of prediction error under distribution drift by analyzing weighted empirical risk minimization, providing a general decomposition of excess risk into learning and drift error terms with oracle inequalities. The results achieve minimax-optimal rates (up to logarithmic factors) in regression problems with linear models, basis approximations, and neural networks.

Weighted empirical risk minimization is a common approach to prediction under distribution drift. This article studies its out-of-sample prediction error under nonstationarity. We provide a general decomposition of the excess risk into a learning term and an error term associated with distribution drift, and prove oracle inequalities for the learning error under mixing conditions. The learning bound holds uniformly over arbitrary weight classes and accounts for the effective sample size induced by the weight vector, the complexity of the weight and hypothesis classes, and potential data dependence. We illustrate the applicability and sharpness of our results in (auto-) regression problems with linear models, basis approximations, and neural networks, recovering minimax-optimal rates (up to logarithmic factors) when specialized to unweighted and stationary settings.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes