MLITLGSep 28, 2024

Generalization and Robustness of the Tilted Empirical Risk

arXiv:2409.19431v34 citationsh-index: 12
Originality Synthesis-oriented
AI Analysis

This work addresses theoretical guarantees for a risk metric in machine learning, offering incremental improvements in understanding generalization and robustness.

The paper analyzes the generalization error of the tilted empirical risk under negative tilt, providing uniform and information-theoretic bounds with a convergence rate of O(n^{-ε/(1+ε)}) for unbounded loss functions, and studies its robustness to noisy outliers and distribution shifts.

The generalization error (risk) of a supervised statistical learning algorithm quantifies its prediction ability on previously unseen data. Inspired by exponential tilting, \citet{li2020tilted} proposed the {\it tilted empirical risk} (TER) as a non-linear risk metric for machine learning applications such as classification and regression problems. In this work, we examine the generalization error of the tilted empirical risk in the robustness regime under \textit{negative tilt}. Our first contribution is to provide uniform and information-theoretic bounds on the {\it tilted generalization error}, defined as the difference between the population risk and the tilted empirical risk, under negative tilt for unbounded loss function under bounded $(1+ε)$-th moment of loss function for some $ε\in(0,1]$ with a convergence rate of $O(n^{-ε/(1+ε)})$ where $n$ is the number of training samples, revealing a novel application for TER under no distribution shift. Secondly, we study the robustness of the tilted empirical risk with respect to noisy outliers at training time and provide theoretical guarantees under distribution shift for the tilted empirical risk. We empirically corroborate our findings in simple experimental setups where we evaluate our bounds to select the value of tilt in a data-driven manner.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes