LGITSTApr 17, 2020

Statistical Learning Guarantees for Compressive Clustering and Compressive Mixture Modeling

arXiv:2004.08085v30.0015 citations
AI Analysis45

This work addresses the need for resource-efficient large-scale unsupervised learning, offering theoretical guarantees for specific compressive methods.

The paper tackles the problem of providing statistical learning guarantees for compressive clustering and compressive Gaussian mixture modeling, establishing sufficient sketch sizes for these tasks based on problem dimensions.

We provide statistical learning guarantees for two unsupervised learning tasks in the context of compressive statistical learning, a general framework for resource-efficient large-scale learning that we introduced in a companion paper.The principle of compressive statistical learning is to compress a training collection, in one pass, into a low-dimensional sketch (a vector of random empirical generalized moments) that captures the information relevant to the considered learning task. We explicitly describe and analyze random feature functions which empirical averages preserve the needed information for compressive clustering and compressive Gaussian mixture modeling with fixed known variance, and establish sufficient sketch sizes given the problem dimensions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes