LGMLDec 30, 2019

Pareto Multi-Task Learning

arXiv:1912.12854v1432 citations
Originality Incremental advance
AI Analysis

This addresses the challenge for practitioners in multi-task learning who need flexible trade-offs among tasks, though it is incremental as it generalizes an existing idea.

The paper tackles the problem of conflicting tasks in multi-task learning by proposing Pareto MTL, a method that finds a set of well-distributed Pareto optimal solutions representing different trade-offs, and it outperforms state-of-the-art algorithms in experiments.

Multi-task learning is a powerful method for solving multiple correlated tasks simultaneously. However, it is often impossible to find one single solution to optimize all the tasks, since different tasks might conflict with each other. Recently, a novel method is proposed to find one single Pareto optimal solution with good trade-off among different tasks by casting multi-task learning as multiobjective optimization. In this paper, we generalize this idea and propose a novel Pareto multi-task learning algorithm (Pareto MTL) to find a set of well-distributed Pareto solutions which can represent different trade-offs among different tasks. The proposed algorithm first formulates a multi-task learning problem as a multiobjective optimization problem, and then decomposes the multiobjective optimization problem into a set of constrained subproblems with different trade-off preferences. By solving these subproblems in parallel, Pareto MTL can find a set of well-representative Pareto optimal solutions with different trade-off among all tasks. Practitioners can easily select their preferred solution from these Pareto solutions, or use different trade-off solutions for different situations. Experimental results confirm that the proposed algorithm can generate well-representative solutions and outperform some state-of-the-art algorithms on many multi-task learning applications.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes