$α$VIL: Learning to Leverage Auxiliary Tasks for Multitask Learning
This addresses the challenge of optimizing task weighting in multitask learning for researchers and practitioners, though it is incremental as it builds on existing weighting methods.
The paper tackles the problem of estimating task weights in multitask learning to improve a target task's performance by leveraging auxiliary tasks, proposing αVIL which dynamically adjusts weights using model updates and outperforms other methods in experiments.
Multitask Learning is a Machine Learning paradigm that aims to train a range of (usually related) tasks with the help of a shared model. While the goal is often to improve the joint performance of all training tasks, another approach is to focus on the performance of a specific target task, while treating the remaining ones as auxiliary data from which to possibly leverage positive transfer towards the target during training. In such settings, it becomes important to estimate the positive or negative influence auxiliary tasks will have on the target. While many ways have been proposed to estimate task weights before or during training they typically rely on heuristics or extensive search of the weighting space. We propose a novel method called $α$-Variable Importance Learning ($α$VIL) that is able to adjust task weights dynamically during model training, by making direct use of task-specific updates of the underlying model's parameters between training epochs. Experiments indicate that $α$VIL is able to outperform other Multitask Learning approaches in a variety of settings. To our knowledge, this is the first attempt at making direct use of model updates for task weight estimation.