ITLGMay 30, 2025

Bounds on the Excess Minimum Risk via Generalized Information Divergence Measures

arXiv:2505.24117v11 citationsh-index: 33Entropy
Originality Incremental advance
AI Analysis

This work provides theoretical bounds for statistical estimation problems, but it is incremental as it extends prior results by relaxing assumptions on sub-Gaussianity.

The paper tackles the problem of bounding the excess minimum risk in estimating a target vector from observed or degraded features, using generalized information divergence measures like Rényi and α-Jensen-Shannon divergences. It shows that these bounds can be tighter than mutual information-based bounds for certain parameter regimes, as demonstrated through numerical examples.

Given finite-dimensional random vectors $Y$, $X$, and $Z$ that form a Markov chain in that order (i.e., $Y \to X \to Z$), we derive upper bounds on the excess minimum risk using generalized information divergence measures. Here, $Y$ is a target vector to be estimated from an observed feature vector $X$ or its stochastically degraded version $Z$. The excess minimum risk is defined as the difference between the minimum expected loss in estimating $Y$ from $X$ and from $Z$. We present a family of bounds that generalize the mutual information based bound of Györfi et al. (2023), using the Rényi and $α$-Jensen-Shannon divergences, as well as Sibson's mutual information. Our bounds are similar to those developed by Modak et al. (2021) and Aminian et al. (2024) for the generalization error of learning algorithms. However, unlike these works, our bounds do not require the sub-Gaussian parameter to be constant and therefore apply to a broader class of joint distributions over $Y$, $X$, and $Z$. We also provide numerical examples under both constant and non-constant sub-Gaussianity assumptions, illustrating that our generalized divergence based bounds can be tighter than the one based on mutual information for certain regimes of the parameter $α$.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes