DS LG NA PR MLNov 13, 2024

Optimal Oblivious Subspace Embeddings with Near-optimal Sparsity

Shabarish Chenakkod, Michał Dereziński, Xiaoyu Dong

arXiv:2411.08773v25.14 citationsh-index: 3ICALP

Originality Highly original

AI Analysis

This work addresses a fundamental problem in randomized linear algebra for applications like matrix approximation and regression, representing a significant theoretical advance rather than an incremental improvement.

The paper tackles the problem of constructing oblivious subspace embeddings with optimal dimension and near-optimal sparsity, achieving a sparsity of $ ilde O(1/ε)$ non-zero entries per column, which nearly matches a long-standing conjecture and improves on a prior bound of $ ilde O(1/ε^6)$.

An oblivious subspace embedding is a random $m\times n$ matrix $Π$ such that, for any $d$-dimensional subspace, with high probability $Π$ preserves the norms of all vectors in that subspace within a $1\pmε$ factor. In this work, we give an oblivious subspace embedding with the optimal dimension $m=Θ(d/ε^2)$ that has a near-optimal sparsity of $\tilde O(1/ε)$ non-zero entries per column of $Π$. This is the first result to nearly match the conjecture of Nelson and Nguyen [FOCS 2013] in terms of the best sparsity attainable by an optimal oblivious subspace embedding, improving on a prior bound of $\tilde O(1/ε^6)$ non-zeros per column [Chenakkod et al., STOC 2024]. We further extend our approach to the non-oblivious setting, proposing a new family of Leverage Score Sparsified embeddings with Independent Columns, which yield faster runtimes for matrix approximation and regression tasks. In our analysis, we develop a new method which uses a decoupling argument together with the cumulant method for bounding the edge universality error of isotropic random matrices. To achieve near-optimal sparsity, we combine this general-purpose approach with new traces inequalities that leverage the specific structure of our subspace embedding construction.

View on arXiv PDF

Similar