CVJul 7, 2025

Consistent Supervised-Unsupervised Alignment for Generalized Category Discovery

arXiv:2507.04725v24 citationsh-index: 14
Originality Incremental advance
AI Analysis

This work addresses a domain-specific challenge in computer vision for improving category discovery in semi-supervised learning, representing an incremental advance over prior GCD methods.

The paper tackles the problem of inconsistent optimization and category confusion in Generalized Category Discovery (GCD), which aims to classify known and discover novel categories from unlabeled data, by proposing the NC-GCD framework that uses fixed ETF prototypes and a consistent alignment loss to achieve strong performance on benchmarks with enhanced novel category accuracy.

Generalized Category Discovery (GCD) focuses on classifying known categories while simultaneously discovering novel categories from unlabeled data. However, previous GCD methods face challenges due to inconsistent optimization objectives and category confusion. This leads to feature overlap and ultimately hinders performance on novel categories. To address these issues, we propose the Neural Collapse-inspired Generalized Category Discovery (NC-GCD) framework. By pre-assigning and fixing Equiangular Tight Frame (ETF) prototypes, our method ensures an optimal geometric structure and a consistent optimization objective for both known and novel categories. We introduce a Consistent ETF Alignment Loss that unifies supervised and unsupervised ETF alignment and enhances category separability. Additionally, a Semantic Consistency Matcher (SCM) is designed to maintain stable and consistent label assignments across clustering iterations. Our method achieves strong performance on multiple GCD benchmarks, significantly enhancing novel category accuracy and demonstrating its effectiveness.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes