CVFeb 12, 2023

Generalized Few-Shot Continual Learning with Contrastive Mixture of Adapters

Yawen Cui, Zitong Yu, Rizhao Cai, Xun Wang, Alex C. Kot, Li Liu

arXiv:2302.05936v16.810 citationsh-index: 62Has Code

Originality Incremental advance

AI Analysis

This work addresses a practical challenge in machine learning for systems needing to learn incrementally from limited data across varying domains, though it appears incremental by extending existing FSCL methods.

The paper tackles the problem of few-shot continual learning by introducing a Generalized FSCL protocol that includes both class- and domain-incremental scenarios with domain generalization assessment, proposing a rehearsal-free Contrastive Mixture of Adapters framework that achieves improved performance on new benchmark datasets.

The goal of Few-Shot Continual Learning (FSCL) is to incrementally learn novel tasks with limited labeled samples and preserve previous capabilities simultaneously, while current FSCL methods are all for the class-incremental purpose. Moreover, the evaluation of FSCL solutions is only the cumulative performance of all encountered tasks, but there is no work on exploring the domain generalization ability. Domain generalization is a challenging yet practical task that aims to generalize beyond training domains. In this paper, we set up a Generalized FSCL (GFSCL) protocol involving both class- and domain-incremental situations together with the domain generalization assessment. Firstly, two benchmark datasets and protocols are newly arranged, and detailed baselines are provided for this unexplored configuration. We find that common continual learning methods have poor generalization ability on unseen domains and cannot better cope with the catastrophic forgetting issue in cross-incremental tasks. In this way, we further propose a rehearsal-free framework based on Vision Transformer (ViT) named Contrastive Mixture of Adapters (CMoA). Due to different optimization targets of class increment and domain increment, the CMoA contains two parts: (1) For the class-incremental issue, the Mixture of Adapters (MoA) module is incorporated into ViT, then cosine similarity regularization and the dynamic weighting are designed to make each adapter learn specific knowledge and concentrate on particular classes. (2) For the domain-related issues and domain-invariant representation learning, we alleviate the inner-class variation by prototype-calibrated contrastive learning. The codes and protocols are available at https://github.com/yawencui/CMoA.

View on arXiv PDF Code

Similar