LGNAMLMay 27, 2018

Fast K-Means Clustering with Anderson Acceleration

arXiv:1805.10638v1
Originality Incremental advance
AI Analysis

This provides a faster clustering method for data analysis applications, but it is incremental as it builds on existing acceleration techniques.

The paper tackles the problem of accelerating Lloyd's algorithm for K-Means clustering by reducing the number of iterations needed for convergence, achieving a mean decrease in computational time of over 33% and outperforming other algorithms in 106 out of 120 test cases.

We propose a novel method to accelerate Lloyd's algorithm for K-Means clustering. Unlike previous acceleration approaches that reduce computational cost per iterations or improve initialization, our approach is focused on reducing the number of iterations required for convergence. This is achieved by treating the assignment step and the update step of Lloyd's algorithm as a fixed-point iteration, and applying Anderson acceleration, a well-established technique for accelerating fixed-point solvers. Classical Anderson acceleration utilizes m previous iterates to find an accelerated iterate, and its performance on K-Means clustering can be sensitive to choice of m and the distribution of samples. We propose a new strategy to dynamically adjust the value of m, which achieves robust and consistent speedups across different problem instances. Our method complements existing acceleration techniques, and can be combined with them to achieve state-of-the-art performance. We perform extensive experiments to evaluate the performance of the proposed method, where it outperforms other algorithms in 106 out of 120 test cases, and the mean decrease ratio of computational time is more than 33%.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes