CVLGJan 2, 2025

ZeroFlow: Overcoming Catastrophic Forgetting is Easier than You Think

arXiv:2501.01045v411 citationsh-index: 17ICML
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of gradient bans in continual learning, offering a benchmark and insights for gradient-free optimization, though it appears incremental in method development.

The paper tackled the problem of catastrophic forgetting in continual learning when gradient information is unavailable, showing that forward pass-based methods alone can mitigate forgetting.

Backpropagation provides a generalized configuration for overcoming catastrophic forgetting. Optimizers such as SGD and Adam are commonly used for weight updates in continual learning and continual pre-training. However, access to gradient information is not always feasible in practice due to black-box APIs, hardware constraints, or non-differentiable systems, a challenge we refer to as the gradient bans. To bridge this gap, we introduce ZeroFlow, the first benchmark designed to evaluate gradient-free optimization algorithms for overcoming forgetting. ZeroFlow examines a suite of forward pass-based methods across various algorithms, forgetting scenarios, and datasets. Our results show that forward passes alone can be sufficient to mitigate forgetting. We uncover novel optimization principles that highlight the potential of forward pass-based methods in mitigating forgetting, managing task conflicts, and reducing memory demands. Additionally, we propose new enhancements that further improve forgetting resistance using only forward passes. This work provides essential tools and insights to advance the development of forward-pass-based methods for continual learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes