STDSLGMLOct 25, 2022

Gaussian Mean Testing Made Simple

CMU
arXiv:2210.13706v16 citationsh-index: 48
Originality Synthesis-oriented
AI Analysis

This work simplifies a fundamental hypothesis testing problem in statistics, making it more accessible, though it is incremental as it builds on prior optimal results.

The paper tackles the problem of distinguishing between a standard Gaussian and a shifted Gaussian with unknown covariance, providing an extremely simple algorithm with a one-page analysis that achieves the optimal sample complexity of Θ(√d/ε²) and runs in linear time.

We study the following fundamental hypothesis testing problem, which we term Gaussian mean testing. Given i.i.d. samples from a distribution $p$ on $\mathbb{R}^d$, the task is to distinguish, with high probability, between the following cases: (i) $p$ is the standard Gaussian distribution, $\mathcal{N}(0,I_d)$, and (ii) $p$ is a Gaussian $\mathcal{N}(μ,Σ)$ for some unknown covariance $Σ$ and mean $μ\in \mathbb{R}^d$ satisfying $\|μ\|_2 \geq ε$. Recent work gave an algorithm for this testing problem with the optimal sample complexity of $Θ(\sqrt{d}/ε^2)$. Both the previous algorithm and its analysis are quite complicated. Here we give an extremely simple algorithm for Gaussian mean testing with a one-page analysis. Our algorithm is sample optimal and runs in sample linear time.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes