LGMar 24, 2021

Scalable Pareto Front Approximation for Deep Multi-Objective Learning

arXiv:2103.13392v271 citationsHas Code
AI Analysis

This addresses the problem of inefficient Pareto front approximation in deep learning for researchers and practitioners, offering a scalable solution that is incremental over prior methods.

The paper tackles the challenge of scalable multi-objective optimization for deep neural networks by proposing a method that conditions networks on preferences augmented to the feature space and penalizes solutions to maintain a well-spread Pareto front. It achieves state-of-the-art quality with significantly faster computation, including a 7% training time overhead on the CelebA dataset with an EfficientNet network.

Multi-objective optimization (MOO) is a prevalent challenge for Deep Learning, however, there exists no scalable MOO solution for truly deep neural networks. Prior work either demand optimizing a new network for every point on the Pareto front, or induce a large overhead to the number of trainable parameters by using hyper-networks conditioned on modifiable preferences. In this paper, we propose to condition the network directly on these preferences by augmenting them to the feature space. Furthermore, we ensure a well-spread Pareto front by penalizing the solutions to maintain a small angle to the preference vector. In a series of experiments, we demonstrate that our Pareto fronts achieve state-of-the-art quality despite being computed significantly faster. Furthermore, we showcase the scalability as our method approximates the full Pareto front on the CelebA dataset with an EfficientNet network at a tiny training time overhead of 7% compared to a simple single-objective optimization. We make our code publicly available at https://github.com/ruchtem/cosmos.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes