LGMay 5, 2023

On the Effectiveness of Equivariant Regularization for Robust Online Continual Learning

arXiv:2305.03648v14 citations
Originality Incremental advance
AI Analysis

This addresses the problem of efficient and robust learning in sequential data streams for AI systems, presenting an incremental improvement by integrating equivariant tasks with existing methods.

The paper tackles catastrophic forgetting in online continual learning by proposing CLER, which uses equivariant regularization for self-supervision, achieving improved performance over contrastive methods in benchmarks.

Humans can learn incrementally, whereas neural networks forget previously acquired information catastrophically. Continual Learning (CL) approaches seek to bridge this gap by facilitating the transfer of knowledge to both previous tasks (backward transfer) and future ones (forward transfer) during training. Recent research has shown that self-supervision can produce versatile models that can generalize well to diverse downstream tasks. However, contrastive self-supervised learning (CSSL), a popular self-supervision technique, has limited effectiveness in online CL (OCL). OCL only permits one iteration of the input dataset, and CSSL's low sample efficiency hinders its use on the input data-stream. In this work, we propose Continual Learning via Equivariant Regularization (CLER), an OCL approach that leverages equivariant tasks for self-supervision, avoiding CSSL's limitations. Our method represents the first attempt at combining equivariant knowledge with CL and can be easily integrated with existing OCL methods. Extensive ablations shed light on how equivariant pretext tasks affect the network's information flow and its impact on CL dynamics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes