LGCVSep 24, 2024

Patch-Based Contrastive Learning and Memory Consolidation for Online Unsupervised Continual Learning

arXiv:2409.16391v11 citationsh-index: 14Has Code
Originality Incremental advance
AI Analysis

This addresses a realistic learning paradigm for applications like terrain exploration where novelty is common, though it is incremental as it combines existing areas into a new setting.

The paper tackles the problem of Online Unsupervised Continual Learning (O-UCL), where an agent learns from a non-stationary, unlabeled data stream to identify increasing classes, and proposes PCMC, which uses patch-based contrastive learning and memory consolidation to achieve competitive performance on ImageNet and Places365 streams.

We focus on a relatively unexplored learning paradigm known as {\em Online Unsupervised Continual Learning} (O-UCL), where an agent receives a non-stationary, unlabeled data stream and progressively learns to identify an increasing number of classes. This paradigm is designed to model real-world applications where encountering novelty is the norm, such as exploring a terrain with several unknown and time-varying entities. Unlike prior work in unsupervised, continual, or online learning, O-UCL combines all three areas into a single challenging and realistic learning paradigm. In this setting, agents are frequently evaluated and must aim to maintain the best possible representation at any point of the data stream, rather than at the end of pre-specified offline tasks. The proposed approach, called \textbf{P}atch-based \textbf{C}ontrastive learning and \textbf{M}emory \textbf{C}onsolidation (PCMC), builds a compositional understanding of data by identifying and clustering patch-level features. Embeddings for these patch-level features are extracted with an encoder trained via patch-based contrastive learning. PCMC incorporates new data into its distribution while avoiding catastrophic forgetting, and it consolidates memory examples during ``sleep" periods. We evaluate PCMC's performance on streams created from the ImageNet and Places365 datasets. Additionally, we explore various versions of the PCMC algorithm and compare its performance against several existing methods and simple baselines.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes