LGAICVMLJul 6, 2024

DMTG: One-Shot Differentiable Multi-Task Grouping

arXiv:2407.05082v13 citationsh-index: 3Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of efficiently handling large-scale multi-task learning for AI applications, though it is incremental as it builds on existing multi-task grouping methods.

The paper tackles the problem of Multi-Task Learning with many tasks by proposing a one-shot method to simultaneously group tasks and train model weights, improving efficiency and mitigating bias compared to sequential approaches, with experiments on CelebA and Taskonomy datasets showing promising performance.

We aim to address Multi-Task Learning (MTL) with a large number of tasks by Multi-Task Grouping (MTG). Given N tasks, we propose to simultaneously identify the best task groups from 2^N candidates and train the model weights simultaneously in one-shot, with the high-order task-affinity fully exploited. This is distinct from the pioneering methods which sequentially identify the groups and train the model weights, where the group identification often relies on heuristics. As a result, our method not only improves the training efficiency, but also mitigates the objective bias introduced by the sequential procedures that potentially lead to a suboptimal solution. Specifically, we formulate MTG as a fully differentiable pruning problem on an adaptive network architecture determined by an underlying Categorical distribution. To categorize N tasks into K groups (represented by K encoder branches), we initially set up KN task heads, where each branch connects to all N task heads to exploit the high-order task-affinity. Then, we gradually prune the KN heads down to N by learning a relaxed differentiable Categorical distribution, ensuring that each task is exclusively and uniquely categorized into only one branch. Extensive experiments on CelebA and Taskonomy datasets with detailed ablations show the promising performance and efficiency of our method. The codes are available at https://github.com/ethanygao/DMTG.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes