CVLGNov 24, 2025

Uncertainty-Aware Dual-Student Knowledge Distillation for Efficient Image Classification

arXiv:2511.18826v11 citations
Originality Incremental advance
AI Analysis

This work addresses model compression for efficient image classification, offering incremental improvements in knowledge distillation techniques.

The paper tackled the problem of inefficient knowledge distillation by proposing an uncertainty-aware dual-student framework that uses teacher prediction uncertainty to guide learning, resulting in improved accuracy on ImageNet-100 with ResNet-18 achieving 83.84% and MobileNetV2 achieving 81.46%, representing gains of 2.04% and 0.92% over baselines.

Knowledge distillation has emerged as a powerful technique for model compression, enabling the transfer of knowledge from large teacher networks to compact student models. However, traditional knowledge distillation methods treat all teacher predictions equally, regardless of the teacher's confidence in those predictions. This paper proposes an uncertainty-aware dual-student knowledge distillation framework that leverages teacher prediction uncertainty to selectively guide student learning. We introduce a peer-learning mechanism where two heterogeneous student architectures, specifically ResNet-18 and MobileNetV2, learn collaboratively from both the teacher network and each other. Experimental results on ImageNet-100 demonstrate that our approach achieves superior performance compared to baseline knowledge distillation methods, with ResNet-18 achieving 83.84\% top-1 accuracy and MobileNetV2 achieving 81.46\% top-1 accuracy, representing improvements of 2.04\% and 0.92\% respectively over traditional single-student distillation approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes