LGAIMay 24, 2024

HINT: Hypernetwork Approach to Training Weight Interval Regions in Continual Learning

arXiv:2405.15444v42 citationsh-index: 16Inf Sci
Originality Highly original
AI Analysis

This addresses catastrophic forgetting in continual learning for AI systems, offering an incremental improvement over prior methods.

The paper tackles the challenge of managing high-dimensional weight intervals in Interval Continual Learning by introducing HINT, which uses interval arithmetic in a lower-dimensional embedding space and a hypernetwork to map to weights, resulting in faster training and state-of-the-art performance on benchmarks.

Recently, a new Continual Learning (CL) paradigm was presented to control catastrophic forgetting, called Interval Continual Learning (InterContiNet), which relies on enforcing interval constraints on the neural network parameter space. Unfortunately, InterContiNet training is challenging due to the high dimensionality of the weight space, making intervals difficult to manage. To address this issue, we introduce HINT, a technique that employs interval arithmetic within the embedding space and utilizes a hypernetwork to map these intervals to the target network parameter space. We train interval embeddings for consecutive tasks and train a hypernetwork to transform these embeddings into weights of the target network. An embedding for a given task is trained along with the hypernetwork, preserving the response of the target network for the previous task embeddings. Interval arithmetic works with a more manageable, lower-dimensional embedding space rather than directly preparing intervals in a high-dimensional weight space. Our model allows faster and more efficient training. Furthermore, HINT maintains the guarantee of not forgetting. At the end of training, we can choose one universal embedding to produce a single network dedicated to all tasks. In such a framework, hypernetwork is used only for training and, finally, we can utilize one set of weights. HINT obtains significantly better results than InterContiNet and gives SOTA results on several benchmarks.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes