CVLGSep 25, 2025

An Adaptor for Triggering Semi-Supervised Learning to Out-of-Box Serve Deep Image Clustering

arXiv:2509.20976v11 citationsh-index: 36IEEE Transactions on Image Processing
Originality Incremental advance
AI Analysis

This addresses the need for flexible, out-of-box application of SSL in image clustering, though it is incremental as it builds on existing SSL-embedded methods.

The paper tackles the problem of enabling semi-supervised learning (SSL) for deep image clustering without requiring pretraining or trained models, by introducing an adaptor called ASD that uses pseudo-labeled data to trigger SSL learners, achieving performance close to SSL methods with ground-truth, such as a 1.33% accuracy gap on CIFAR-10.

Recently, some works integrate SSL techniques into deep clustering frameworks to enhance image clustering performance. However, they all need pretraining, clustering learning, or a trained clustering model as prerequisites, limiting the flexible and out-of-box application of SSL learners in the image clustering task. This work introduces ASD, an adaptor that enables the cold-start of SSL learners for deep image clustering without any prerequisites. Specifically, we first randomly sample pseudo-labeled data from all unlabeled data, and set an instance-level classifier to learn them with semantically aligned instance-level labels. With the ability of instance-level classification, we track the class transitions of predictions on unlabeled data to extract high-level similarities of instance-level classes, which can be utilized to assign cluster-level labels to pseudo-labeled data. Finally, we use the pseudo-labeled data with assigned cluster-level labels to trigger a general SSL learner trained on the unlabeled data for image clustering. We show the superior performance of ASD across various benchmarks against the latest deep image clustering approaches and very slight accuracy gaps compared to SSL methods using ground-truth, e.g., only 1.33% on CIFAR-10. Moreover, ASD can also further boost the performance of existing SSL-embedded deep image clustering methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes