LGCVFeb 5, 2025

Deep Clustering via Probabilistic Ratio-Cut Optimization

arXiv:2502.03405v23 citationsh-index: 23AISTATS
Originality Highly original
AI Analysis

This provides a novel clustering method that can leverage self-supervised representations for practitioners in unsupervised learning.

The paper tackles the graph clustering problem by proposing a probabilistic approach (PRCut) that models binary assignments as random variables and optimizes an upper bound on the expected ratio-cut. The method outperforms existing approaches including Rayleigh quotient relaxation and achieves competitive performance comparable to supervised classifiers when label-based similarities are provided.

We propose a novel approach for optimizing the graph ratio-cut by modeling the binary assignments as random variables. We provide an upper bound on the expected ratio-cut, as well as an unbiased estimate of its gradient, to learn the parameters of the assignment variables in an online setting. The clustering resulting from our probabilistic approach (PRCut) outperforms the Rayleigh quotient relaxation of the combinatorial problem, its online learning extensions, and several widely used methods. We demonstrate that the PRCut clustering closely aligns with the similarity measure and can perform as well as a supervised classifier when label-based similarities are provided. This novel approach can leverage out-of-the-box self-supervised representations to achieve competitive performance and serve as an evaluation method for the quality of these representations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes