CV LGJul 3, 2020

Eliminating Catastrophic Interference with Biased Competition

Amelia Elizabeth Pollard, Jonathan L. Shapiro

arXiv:2007.02833v11.2

Originality Incremental advance

AI Analysis

This addresses the problem of catastrophic interference in multi-task learning for AI researchers, offering an incremental improvement by applying a neuroscience-inspired method to existing datasets.

The paper tackles catastrophic interference in multi-task learning by introducing a model based on biased competition from neuroscience, which learns to separate tasks end-to-end without extra labeling. It demonstrates elimination of interference on a new toy dataset and achieves competitive results on Visual Question Answering benchmarks like COCO-QA and DAQUAR-FULL.

We present here a model to take advantage of the multi-task nature of complex datasets by learning to separate tasks and subtasks in and end to end manner by biasing competitive interactions in the network. This method does not require additional labelling or reformatting of data in a dataset. We propose an alternate view to the monolithic one-task-fits-all learning of multi-task problems, and describe a model based on a theory of neuronal attention from neuroscience, proposed by Desimone. We create and exhibit a new toy dataset, based on the MNIST dataset, which we call MNIST-QA, for testing Visual Question Answering architectures in a low-dimensional environment while preserving the more difficult components of the Visual Question Answering task, and demonstrate the proposed network architecture on this new dataset, as well as on COCO-QA and DAQUAR-FULL. We then demonstrate that this model eliminates catastrophic interference between tasks on a newly created toy dataset and provides competitive results in the Visual Question Answering space. We provide further evidence that Visual Question Answering can be approached as a multi-task problem, and demonstrate that this new architecture based on the Biased Competition model is capable of learning to separate and learn the tasks in an end-to-end fashion without the need for task labels.

View on arXiv PDF

Similar