LGMLApr 26, 2018

Quantized Compressive K-Means

arXiv:1804.10109v223 citations
Originality Incremental advance
AI Analysis

This work addresses resource waste in acquisition for large-scale data clustering, offering an incremental improvement for compressive learning methods.

The paper tackles the inefficiency of digital implementations in compressive K-means by generalizing the sketching procedure to include hardware-friendly periodic nonlinearities, such as 1-bit universal quantization, resulting in minimal impact on clustering performance as shown in numerical experiments.

The recent framework of compressive statistical learning aims at designing tractable learning algorithms that use only a heavily compressed representation-or sketch-of massive datasets. Compressive K-Means (CKM) is such a method: it estimates the centroids of data clusters from pooled, non-linear, random signatures of the learning examples. While this approach significantly reduces computational time on very large datasets, its digital implementation wastes acquisition resources because the learning examples are compressed only after the sensing stage. The present work generalizes the sketching procedure initially defined in Compressive K-Means to a large class of periodic nonlinearities including hardware-friendly implementations that compressively acquire entire datasets. This idea is exemplified in a Quantized Compressive K-Means procedure, a variant of CKM that leverages 1-bit universal quantization (i.e. retaining the least significant bit of a standard uniform quantizer) as the periodic sketch nonlinearity. Trading for this resource-efficient signature (standard in most acquisition schemes) has almost no impact on the clustering performances, as illustrated by numerical experiments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes