OC LGSep 29, 2020

Projection-Free Adaptive Gradients for Large-Scale Optimization

Cyrille W. Combettes, Christoph Spiegel, Sebastian Pokutta

arXiv:2009.14114v311.913 citations

Originality Incremental advance

AI Analysis

This work addresses computational efficiency in constrained optimization for machine learning practitioners, representing an incremental improvement.

The paper tackled the problem of large-scale constrained optimization by improving stochastic Frank-Wolfe algorithms with adaptive gradients, resulting in enhanced convergence rates and computational advantages over state-of-the-art methods on convex and nonconvex objectives.

The complexity in large-scale optimization can lie in both handling the objective function and handling the constraint set. In this respect, stochastic Frank-Wolfe algorithms occupy a unique position as they alleviate both computational burdens, by querying only approximate first-order information from the objective and by maintaining feasibility of the iterates without using projections. In this paper, we improve the quality of their first-order information by blending in adaptive gradients. We derive convergence rates and demonstrate the computational advantage of our method over the state-of-the-art stochastic Frank-Wolfe algorithms on both convex and nonconvex objectives. The experiments further show that our method can improve the performance of adaptive gradient algorithms for constrained optimization.

View on arXiv PDF

Similar