LGOCJun 7, 2024

On PI Controllers for Updating Lagrange Multipliers in Constrained Optimization

arXiv:2406.04558v17 citations
Originality Incremental advance
AI Analysis

This addresses a reliability issue in constrained optimization for machine learning practitioners, offering a general-purpose solution, though it builds on prior work.

The paper tackles the unstable oscillatory dynamics in constrained optimization for neural networks by proposing the $ν$PI algorithm, which stabilizes Lagrange multiplier updates and shows robust hyperparameter behavior in experiments.

Constrained optimization offers a powerful framework to prescribe desired behaviors in neural network models. Typically, constrained problems are solved via their min-max Lagrangian formulations, which exhibit unstable oscillatory dynamics when optimized using gradient descent-ascent. The adoption of constrained optimization techniques in the machine learning community is currently limited by the lack of reliable, general-purpose update schemes for the Lagrange multipliers. This paper proposes the $ν$PI algorithm and contributes an optimization perspective on Lagrange multiplier updates based on PI controllers, extending the work of Stooke, Achiam and Abbeel (2020). We provide theoretical and empirical insights explaining the inability of momentum methods to address the shortcomings of gradient descent-ascent, and contrast this with the empirical success of our proposed $ν$PI controller. Moreover, we prove that $ν$PI generalizes popular momentum methods for single-objective minimization. Our experiments demonstrate that $ν$PI reliably stabilizes the multiplier dynamics and its hyperparameters enjoy robust and predictable behavior.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes