LGAIMay 8, 2024

Few-Shot Class Incremental Learning via Robust Transformer Approach

arXiv:2405.05984v15 citationsh-index: 19Inf Sci
Originality Incremental advance
AI Analysis

This addresses a critical challenge in machine learning for scenarios requiring continuous learning from scarce data, though it appears incremental as it builds on existing transformer architectures.

The paper tackles the problem of few-shot class-incremental learning, where models must learn new classes with limited data while avoiding catastrophic forgetting, and demonstrates that their Robust Transformer Approach outperforms prior methods by significant margins without data augmentation.

Few-Shot Class-Incremental Learning presents an extension of the Class Incremental Learning problem where a model is faced with the problem of data scarcity while addressing the catastrophic forgetting problem. This problem remains an open problem because all recent works are built upon the convolutional neural networks performing sub-optimally compared to the transformer approaches. Our paper presents Robust Transformer Approach built upon the Compact Convolution Transformer. The issue of overfitting due to few samples is overcome with the notion of the stochastic classifier, where the classifier's weights are sampled from a distribution with mean and variance vectors, thus increasing the likelihood of correct classifications, and the batch-norm layer to stabilize the training process. The issue of CF is dealt with the idea of delta parameters, small task-specific trainable parameters while keeping the backbone networks frozen. A non-parametric approach is developed to infer the delta parameters for the model's predictions. The prototype rectification approach is applied to avoid biased prototype calculations due to the issue of data scarcity. The advantage of ROBUSTA is demonstrated through a series of experiments in the benchmark problems where it is capable of outperforming prior arts with big margins without any data augmentation protocols.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes