LG CVSep 16, 2025

iCD: A Implicit Clustering Distillation Mathod for Structural Information Mining

Xiang Xue, Yatu Ji, Qing-dao-er-ji Ren, Bao Shi, Min Lu, Nier Wu, Xufei Zhuang, Haiteng Xu, Gan-qi-qi-ge Cha

arXiv:2509.12553v14.1h-index: 6Has Code

Originality Incremental advance

AI Analysis

This work addresses the problem of interpretability in knowledge distillation for researchers and practitioners, offering a novel method that is incremental but effective in specific domains.

The paper tackles the limited interpretability in logit knowledge distillation by proposing implicit Clustering Distillation (iCD), which mines and transfers structural knowledge from logits without labels or feature alignment, achieving a peak improvement of +5.08% over the baseline in fine-grained classification tasks.

Logit Knowledge Distillation has gained substantial research interest in recent years due to its simplicity and lack of requirement for intermediate feature alignment; however, it suffers from limited interpretability in its decision-making process. To address this, we propose implicit Clustering Distillation (iCD): a simple and effective method that mines and transfers interpretable structural knowledge from logits, without requiring ground-truth labels or feature-space alignment. iCD leverages Gram matrices over decoupled local logit representations to enable student models to learn latent semantic structural patterns. Extensive experiments on benchmark datasets demonstrate the effectiveness of iCD across diverse teacher-student architectures, with particularly strong performance in fine-grained classification tasks -- achieving a peak improvement of +5.08% over the baseline. The code is available at: https://github.com/maomaochongaa/iCD.

View on arXiv PDF Code

Similar