CVAIDec 10, 2025

Representation Calibration and Uncertainty Guidance for Class-Incremental Learning based on Vision Language Model

arXiv:2512.09441v1h-index: 3
Originality Incremental advance
AI Analysis

This addresses the problem of forgetting old classes while learning new ones in incremental learning for image classification, but it is incremental as it builds on existing VLM methods.

The paper tackles class confusion in vision-language model-based class-incremental learning by proposing a framework with cross-task representation calibration and uncertainty-guided inference, achieving superior performance on multiple datasets.

Class-incremental learning requires a learning system to continually learn knowledge of new classes and meanwhile try to preserve previously learned knowledge of old classes. As current state-of-the-art methods based on Vision-Language Models (VLMs) still suffer from the issue of differentiating classes across learning tasks. Here a novel VLM-based continual learning framework for image classification is proposed. In this framework, task-specific adapters are added to the pre-trained and frozen image encoder to learn new knowledge, and a novel cross-task representation calibration strategy based on a mixture of light-weight projectors is used to help better separate all learned classes in a unified feature space, alleviating class confusion across tasks. In addition, a novel inference strategy guided by prediction uncertainty is developed to more accurately select the most appropriate image feature for class prediction. Extensive experiments on multiple datasets under various settings demonstrate the superior performance of our method compared to existing ones.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes