OCLGSYJul 17, 2021

On Constraints in First-Order Optimization: A View from Non-Smooth Dynamical Systems

arXiv:2107.08225v327 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of efficient optimization for machine learning problems with nonlinear or structurally complex constraints, representing an incremental improvement over existing methods.

The paper tackles the problem of constrained optimization in large-scale settings by introducing first-order methods that avoid projections onto complex feasible sets and allow infeasible iterates, resulting in simpler algorithms suitable for machine learning applications.

We introduce a class of first-order methods for smooth constrained optimization that are based on an analogy to non-smooth dynamical systems. Two distinctive features of our approach are that (i) projections or optimizations over the entire feasible set are avoided, in stark contrast to projected gradient methods or the Frank-Wolfe method, and (ii) iterates are allowed to become infeasible, which differs from active set or feasible direction methods, where the descent motion stops as soon as a new constraint is encountered. The resulting algorithmic procedure is simple to implement even when constraints are nonlinear, and is suitable for large-scale constrained optimization problems in which the feasible set fails to have a simple structure. The key underlying idea is that constraints are expressed in terms of velocities instead of positions, which has the algorithmic consequence that optimizations over feasible sets at each iteration are replaced with optimizations over local, sparse convex approximations. In particular, this means that at each iteration only constraints that are violated are taken into account. The result is a simplified suite of algorithms and an expanded range of possible applications in machine learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes