LGApr 11

Mild Over-Parameterization Benefits Asymmetric Tensor PCA

arXiv:2604.1020835.6h-index: 3
AI Analysis

This work provides the first tractable algorithm for asymmetric tensor PCA with memory cost independent of d^{k̄}, addressing a key bottleneck in high-dimensional tensor problems.

The paper proposes a matrix-parameterized algorithm for Asymmetric Tensor PCA that achieves near-optimal sample complexity with only d² memory, overcoming the typical d^{⌈k̄/2⌉} memory requirement. The method improves sample efficiency and adapts to problem structure, achieving d^{k̄/2} sample complexity in symmetric limits.

Asymmetric Tensor PCA (ATPCA) is a prototypical model for studying the trade-offs between sample complexity, computation, and memory. Existing algorithms for this problem typically require at least $d^{\left\lceil\overline{k}/2\right\rceil}$ state memory cost to recover the signal, where $d$ is the vector dimension and $\overline{k}$ is the tensor order. We focus on the setting where $\overline{k} \geq 4$ is even and consider (stochastic) gradient descent-based algorithms under a limited memory budget, which permits only mild over-parameterization of the model. We propose a matrix-parameterized method (in $d^{2}$ state memory cost) using a novel three-phase alternating-update algorithm to address the problem and demonstrate how mild over-parameterization facilitates learning in two key aspects: (i) it improves sample efficiency, allowing our method to achieve \emph{near-optimal} $d^{\overline{k}-2}$ sample complexity in our limited memory setting; and (ii) it enhances adaptivity to problem structure, a previously unrecognized phenomenon, where the required sample size naturally decreases as consecutive vectors become more aligned, and in the symmetric limit attains $d^{\overline{k}/2}$, matching the \emph{best} known polynomial-time complexity. To our knowledge, this is the \emph{first} tractable algorithm for ATPCA with $d^{\overline{k}}$-independent memory costs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes