LGOCSep 18, 2021

An Accelerated Variance-Reduced Conditional Gradient Sliding Algorithm for First-order and Zeroth-order Optimization

arXiv:2109.08858v11 citations
Originality Incremental advance
AI Analysis

This addresses a gap in optimization methods for machine learning practitioners dealing with zeroth-order problems, though it is incremental as it extends existing conditional gradient algorithms.

The paper tackles the problem of constrained optimization in zeroth-order settings where only function values are available, proposing the ARCS algorithm that works with both first-order and zeroth-order information and achieves improved convergence results in first-order optimization, with experiments validating its superiority on real-world datasets.

The conditional gradient algorithm (also known as the Frank-Wolfe algorithm) has recently regained popularity in the machine learning community due to its projection-free property to solve constrained problems. Although many variants of the conditional gradient algorithm have been proposed to improve performance, they depend on first-order information (gradient) to optimize. Naturally, these algorithms are unable to function properly in the field of increasingly popular zeroth-order optimization, where only zeroth-order information (function value) is available. To fill in this gap, we propose a novel Accelerated variance-Reduced Conditional gradient Sliding (ARCS) algorithm for finite-sum problems, which can use either first-order or zeroth-order information to optimize. To the best of our knowledge, ARCS is the first zeroth-order conditional gradient sliding type algorithms solving convex problems in zeroth-order optimization. In first-order optimization, the convergence results of ARCS substantially outperform previous algorithms in terms of the number of gradient query oracle. Finally we validated the superiority of ARCS by experiments on real-world datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes