OCLGSep 29, 2020

Projection-Free Adaptive Gradients for Large-Scale Optimization

arXiv:2009.14114v313 citations
Originality Incremental advance
AI Analysis

This work addresses computational efficiency in constrained optimization for machine learning practitioners, representing an incremental improvement.

The paper tackled the problem of large-scale constrained optimization by improving stochastic Frank-Wolfe algorithms with adaptive gradients, resulting in enhanced convergence rates and computational advantages over state-of-the-art methods on convex and nonconvex objectives.

The complexity in large-scale optimization can lie in both handling the objective function and handling the constraint set. In this respect, stochastic Frank-Wolfe algorithms occupy a unique position as they alleviate both computational burdens, by querying only approximate first-order information from the objective and by maintaining feasibility of the iterates without using projections. In this paper, we improve the quality of their first-order information by blending in adaptive gradients. We derive convergence rates and demonstrate the computational advantage of our method over the state-of-the-art stochastic Frank-Wolfe algorithms on both convex and nonconvex objectives. The experiments further show that our method can improve the performance of adaptive gradient algorithms for constrained optimization.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes