CV AISep 27, 2020

Beneficial Perturbation Network for designing general adaptive artificial intelligence systems

Shixian Wen, Amanda Rios, Yunhao Ge, Laurent Itti

arXiv:2009.13954v27.918 citations

Originality Incremental advance

AI Analysis

This addresses the problem of dynamic adaptability in AI systems for applications like continual learning, though it appears incremental as it builds on existing perturbation concepts.

The paper tackles catastrophic forgetting in continual learning by introducing a biologically plausible deep neural network with task-dependent biasing units, enabling a single network to learn unlimited parallel mappings and switch between them at runtime, achieving state-of-the-art performance across tasks and domains.

The human brain is the gold standard of adaptive learning. It not only can learn and benefit from experience, but also can adapt to new situations. In contrast, deep neural networks only learn one sophisticated but fixed mapping from inputs to outputs. This limits their applicability to more dynamic situations, where input to output mapping may change with different contexts. A salient example is continual learning - learning new independent tasks sequentially without forgetting previous tasks. Continual learning of multiple tasks in artificial neural networks using gradient descent leads to catastrophic forgetting, whereby a previously learned mapping of an old task is erased when learning new mappings for new tasks. Here, we propose a new biologically plausible type of deep neural network with extra, out-of-network, task-dependent biasing units to accommodate these dynamic situations. This allows, for the first time, a single network to learn potentially unlimited parallel input to output mappings, and to switch on the fly between them at runtime. Biasing units are programmed by leveraging beneficial perturbations (opposite to well-known adversarial perturbations) for each task. Beneficial perturbations for a given task bias the network toward that task, essentially switching the network into a different mode to process that task. This largely eliminates catastrophic interference between tasks. Our approach is memory-efficient and parameter-efficient, can accommodate many tasks, and achieves state-of-the-art performance across different tasks and domains.

View on arXiv PDF

Similar