CVLGMar 6, 2025

FUSE: First-Order and Second-Order Unified SynthEsis in Stochastic Optimization

arXiv:2503.04204v11 citationsh-index: 16CAI
Originality Incremental advance
AI Analysis

This work addresses the computational inefficiency of second-order methods in large-scale machine learning, offering a practical solution for optimization tasks, though it appears incremental in nature.

The paper tackles the challenge of combining first-order and second-order methods in stochastic optimization by introducing FUSE-PV, a unified framework that switches between them based on criteria, achieving smaller computational complexity than SGD and Adam.

Stochastic optimization methods have actively been playing a critical role in modern machine learning algorithms to deliver decent performance. While numerous works have proposed and developed diverse approaches, first-order and second-order methods are in entirely different situations. The former is significantly pivotal and dominating in emerging deep learning but only leads convergence to a stationary point. However, second-order methods are less popular due to their computational intensity in large-dimensional problems. This paper presents a novel method that leverages both the first-order and second-order methods in a unified algorithmic framework, termed FUSE, from which a practical version (PV) is derived accordingly. FUSE-PV stands as a simple yet efficient optimization method involving a switch-over between first and second orders. Additionally, we develop different criteria that determine when to switch. FUSE-PV has provably shown a smaller computational complexity than SGD and Adam. To validate our proposed scheme, we present an ablation study on several simple test functions and show a comparison with baselines for benchmark datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes