LGDSNAMLJul 20, 2020

Learning the Positions in CountSketch

arXiv:2007.09890v326 citations
AI Analysis

This work addresses a bottleneck in data compression and optimization for machine learning practitioners, representing a moderate advance over previous methods that only learned values.

The authors tackled the problem of learning-based sketching algorithms by proposing the first method that optimizes both the values and positions of non-zero entries in CountSketch matrices, achieving better accuracy for low-rank approximation and applying it to k-means clustering.

We consider sketching algorithms which first quickly compress data by multiplication with a random sketch matrix, and then apply the sketch to quickly solve an optimization problem, e.g., low rank approximation. In the learning-based sketching paradigm proposed by Indyk et al. [2019], the sketch matrix is found by choosing a random sparse matrix, e.g., the CountSketch, and then updating the values of the non-zero entries by running gradient descent on a training data set. Despite the growing body of work on this paradigm, a noticeable omission is that the locations of the non-zero entries of previous algorithms were fixed, and only their values were learned. In this work we propose the first learning algorithm that also optimizes the locations of the non-zero entries. We show this algorithm gives better accuracy for low rank approximation than previous work, and apply it to other problems such as $k$-means clustering for the first time. We show that our algorithm is provably better in the spiked covariance model and for Zipfian matrices. We also show the importance of the sketch monotonicity property for combining learned sketches. Our empirical results show the importance of optimizing not only the values of the non-zero entries but also their positions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes