MLLGOct 21, 2021

Mean Nyström Embeddings for Adaptive Compressive Learning

arXiv:2110.10996v28 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of scalable machine learning for large datasets, offering an incremental improvement over existing sketching methods.

The paper tackles the problem of inefficient large-scale learning by proposing data-dependent Nyström sketches for compressive learning, showing empirically that they outperform random feature sketches in tasks like k-means clustering and Gaussian modeling for a fixed sketch size.

Compressive learning is an approach to efficient large scale learning based on sketching an entire dataset to a single mean embedding (the sketch), i.e. a vector of generalized moments. The learning task is then approximately solved as an inverse problem using an adapted parametric model. Previous works in this context have focused on sketches obtained by averaging random features, that while universal can be poorly adapted to the problem at hand. In this paper, we propose and study the idea of performing sketching based on data-dependent Nyström approximation. From a theoretical perspective we prove that the excess risk can be controlled under a geometric assumption relating the parametric model used to learn from the sketch and the covariance operator associated to the task at hand. Empirically, we show for k-means clustering and Gaussian modeling that for a fixed sketch size, Nyström sketches indeed outperform those built with random features.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes