SYLGMar 14, 2024

Learning to optimize with convergence guarantees using nonlinear system theory

arXiv:2403.09389v214 citationsIEEE Control Systems Letters
Originality Highly original
AI Analysis

This provides a theoretical foundation for L2O methods, enabling reliable algorithm design for optimization in machine learning and control systems, though it is incremental in building on existing L2O paradigms.

The paper addresses the lack of theoretical convergence guarantees in learning-to-optimize (L2O) methods by proposing a framework based on nonlinear system theory that parametrizes all convergent algorithms for smooth non-convex functions, ensuring convergence by design while learning.

The increasing reliance on numerical methods for controlling dynamical systems and training machine learning models underscores the need to devise algorithms that dependably and efficiently navigate complex optimization landscapes. Classical gradient descent methods offer strong theoretical guarantees for convex problems; however, they demand meticulous hyperparameter tuning for non-convex ones. The emerging paradigm of learning to optimize (L2O) automates the discovery of algorithms with optimized performance leveraging learning models and data - yet, it lacks a theoretical framework to analyze convergence of the learned algorithms. In this paper, we fill this gap by harnessing nonlinear system theory. Specifically, we propose an unconstrained parametrization of all convergent algorithms for smooth non-convex objective functions. Notably, our framework is directly compatible with automatic differentiation tools, ensuring convergence by design while learning to optimize.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes