CVJul 16, 2024

CroMo-Mixup: Augmenting Cross-Model Representations for Continual Self-Supervised Learning

arXiv:2407.12188v16 citationsh-index: 53Has Code
Originality Incremental advance
AI Analysis

This work addresses task confusion in continual self-supervised learning, which is an incremental improvement for scenarios with less diverse class incremental settings.

The paper tackles the task confusion problem in continual self-supervised learning by introducing CroMo-Mixup, a framework that mixes data and features across tasks to enhance negative sample diversity and facilitate cross-task learning, resulting in improved Task-ID prediction and average linear accuracy on datasets like CIFAR10, CIFAR100, and tinyImageNet.

Continual self-supervised learning (CSSL) learns a series of tasks sequentially on the unlabeled data. Two main challenges of continual learning are catastrophic forgetting and task confusion. While CSSL problem has been studied to address the catastrophic forgetting challenge, little work has been done to address the task confusion aspect. In this work, we show through extensive experiments that self-supervised learning (SSL) can make CSSL more susceptible to the task confusion problem, particularly in less diverse settings of class incremental learning because different classes belonging to different tasks are not trained concurrently. Motivated by this challenge, we present a novel cross-model feature Mixup (CroMo-Mixup) framework that addresses this issue through two key components: 1) Cross-Task data Mixup, which mixes samples across tasks to enhance negative sample diversity; and 2) Cross-Model feature Mixup, which learns similarities between embeddings obtained from current and old models of the mixed sample and the original images, facilitating cross-task class contrast learning and old knowledge retrieval. We evaluate the effectiveness of CroMo-Mixup to improve both Task-ID prediction and average linear accuracy across all tasks on three datasets, CIFAR10, CIFAR100, and tinyImageNet under different class-incremental learning settings. We validate the compatibility of CroMo-Mixup on four state-of-the-art SSL objectives. Code is available at \url{https://github.com/ErumMushtaq/CroMo-Mixup}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes