LG MLMay 30

Global Convergence of Adaptive Sensing for Principal Eigenvector Estimation

Alex Saad-Falcon, Brighton Ancelin, Justin Romberg

arXiv:2505.108827.4h-index: 1

Predicted impact top 84% in LG · last 90 daysOriginality Highly original

AI Analysis

For practitioners in signal processing and machine learning who need to estimate principal components under hardware constraints, this work provides the first convergence guarantee for adaptive compressed subspace tracking in the noisy setting.

This paper analyzes a compressed variant of Oja's algorithm for principal eigenvector estimation using only two adaptive measurements per sample, proving an expected error rate of O(λ1λ2 d^2 / (Δ^2 t)) and a matching lower bound, showing that the d^2 factor is the fundamental cost of compression. It also separates the rates for fully-observed, adaptive-compressed, and non-adaptive-compressed PCA across three powers of d.

Principal component analysis classically requires full $d$-dimensional samples, yet in various applications hardware limits acquisition to a few scalar measurements per sample. We analyze a compressed variant of Oja's algorithm for estimating the principal eigenvector of the data covariance matrix using only two adaptive measurements per sample. At each iteration, we observe one measurement along the current estimate and one in a random orthogonal direction. We prove that after $t$ iterations, the expected sine-squared error to the true eigenvector is $\mathcal{O}(λ_1λ_2 d^2 / (Δ^2 t))$, where $d$ is the ambient dimension, $λ_1, λ_2$ are the leading eigenvalues, and $Δ= λ_1 - λ_2$ is the eigengap. We complement this with a matching information-theoretic lower bound of $Ω(λ_1λ_2 d^2 / (Δ^2 t))$ -- the first for compressed eigenvector estimation -- proving that the $d^2$ factor, an additional factor of $d$ compared to the fully-observed minimax rate $Θ(λ_1λ_2 d / (Δ^2 t))$, is the fundamental cost of compression and cannot be improved. In contrast, any non-adaptive scheme with two measurements per iteration suffers $Ω(λ_2^2 d^3 / (Δ^2 t))$, an additional power of $d$. This separates fully-observed, adaptive-compressed, and non-adaptive-compressed PCA across three powers of $d$. Our analysis handles the noisy setting where the covariance has nonzero trailing eigenvalues, providing the first convergence guarantee for adaptive compressed subspace tracking beyond the noiseless case.

View on arXiv PDF

Similar