STLGMEMLJul 26, 2025

Extreme value theory for singular subspace estimation in the matrix denoising model

arXiv:2507.19978v1h-index: 1
Originality Incremental advance
AI Analysis

This provides a statistical tool for detecting low-rank signal structures in noisy matrix data, particularly useful in applications where differences are sparse, but it is incremental relative to existing matrix denoising literature.

The paper tackles the problem of estimating singular subspaces in matrix denoising by establishing that the maximum Euclidean row norm of the difference between sample and population singular vectors follows a Gumbel distribution asymptotically, and applies this to develop a hypothesis test with higher power for structured alternatives compared to Frobenius norm-based methods.

This paper studies fine-grained singular subspace estimation in the matrix denoising model where a deterministic low-rank signal matrix is additively perturbed by a stochastic matrix of Gaussian noise. We establish that the maximum Euclidean row norm (i.e., the two-to-infinity norm) of the aligned difference between the leading sample and population singular vectors approaches the Gumbel distribution in the large-matrix limit, under suitable signal-to-noise conditions and after appropriate centering and scaling. We apply our novel asymptotic distributional theory to test hypotheses of low-rank signal structure encoded in the leading singular vectors and their corresponding principal subspace. We provide de-biased estimators for the corresponding nuisance signal singular values and show that our proposed plug-in test statistic has desirable properties. Notably, compared to using the Frobenius norm subspace distance, our test statistic based on the two-to-infinity norm has higher power to detect structured alternatives that differ from the null in only a few matrix entries or rows. Our main results are obtained by a novel synthesis of and technical analysis involving entrywise matrix perturbation analysis, extreme value theory, saddle point approximation methods, and random matrix theory. Our contributions complement the existing literature for matrix denoising focused on minimaxity, mean squared error analysis, unitarily invariant distances between subspaces, component-wise asymptotic distributional theory, and row-wise uniform error bounds. Numerical simulations illustrate our main results and demonstrate the robustness properties of our testing procedure to non-Gaussian noise distributions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes