LGAIOCJun 23, 2024

Jacobian Descent for Multi-Objective Optimization

arXiv:2406.16232v317 citations
Originality Highly original
AI Analysis

This work addresses the problem of balancing multiple conflicting objectives in optimization for researchers and practitioners, offering a novel method that improves upon existing gradient combination techniques.

The paper tackles multi-objective optimization by introducing Jacobian descent, a generalization of gradient descent that uses the Jacobian matrix to handle conflicting objectives, and demonstrates stronger convergence guarantees and promising results in image classification tasks.

Many optimization problems require balancing multiple conflicting objectives. As gradient descent is limited to single-objective optimization, we introduce its direct generalization: Jacobian descent (JD). This algorithm iteratively updates parameters using the Jacobian matrix of a vector-valued objective function, in which each row is the gradient of an individual objective. While several methods to combine gradients already exist in the literature, they are generally hindered when the objectives conflict. In contrast, we propose projecting gradients to fully resolve conflict while ensuring that they preserve an influence proportional to their norm. We prove significantly stronger convergence guarantees with this approach, supported by our empirical results. Our method also enables instance-wise risk minimization (IWRM), a novel learning paradigm in which the loss of each training example is considered a separate objective. Applied to simple image classification tasks, IWRM exhibits promising results compared to the direct minimization of the average loss. Additionally, we outline an efficient implementation of JD using the Gramian of the Jacobian matrix to reduce time and memory requirements.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes