CVFeb 27, 2025

Twofold Debiasing Enhances Fine-Grained Learning with Coarse Labels

Xin-yang Zhao, Jian Jin, Yang-yang Li, Yazhou Yao

arXiv:2502.19816v16.21 citationsh-index: 2Has CodeAAAI

Originality Incremental advance

AI Analysis

This work solves a domain-specific problem in fine-grained recognition for computer vision, with incremental improvements over existing methods.

The paper tackled the Coarse-to-Fine Few-Shot task by addressing challenges in fine-grained feature extraction and overfitting from biased distributions, achieving state-of-the-art results on five benchmark datasets.

The Coarse-to-Fine Few-Shot (C2FS) task is designed to train models using only coarse labels, then leverages a limited number of subclass samples to achieve fine-grained recognition capabilities. This task presents two main challenges: coarse-grained supervised pre-training suppresses the extraction of critical fine-grained features for subcategory discrimination, and models suffer from overfitting due to biased distributions caused by limited fine-grained samples. In this paper, we propose the Twofold Debiasing (TFB) method, which addresses these challenges through detailed feature enhancement and distribution calibration. Specifically, we introduce a multi-layer feature fusion reconstruction module and an intermediate layer feature alignment module to combat the model's tendency to focus on simple predictive features directly related to coarse-grained supervision, while neglecting complex fine-grained level details. Furthermore, we mitigate the biased distributions learned by the fine-grained classifier using readily available coarse-grained sample embeddings enriched with fine-grained information. Extensive experiments conducted on five benchmark datasets demonstrate the efficacy of our approach, achieving state-of-the-art results that surpass competitive methods.

View on arXiv PDF Code

Similar