LGSep 5, 2024

Reducing Bias in Deep Learning Optimization: The RSGDM Approach

arXiv:2409.15314v111 citationsh-index: 7
Originality Incremental advance
AI Analysis

This work addresses bias issues in optimization for deep learning practitioners, but it is incremental as it builds on existing SGDM methods.

The paper tackled bias and lag in deep learning optimizers by proposing the RSGDM algorithm, which uses differential correction to improve upon SGDM, and experiments on CIFAR datasets showed superior convergence accuracy.

Currently, widely used first-order deep learning optimizers include non-adaptive learning rate optimizers and adaptive learning rate optimizers. The former is represented by SGDM (Stochastic Gradient Descent with Momentum), while the latter is represented by Adam. Both of these methods use exponential moving averages to estimate the overall gradient. However, estimating the overall gradient using exponential moving averages is biased and has a lag. This paper proposes an RSGDM algorithm based on differential correction. Our contributions are mainly threefold: 1) Analyze the bias and lag brought by the exponential moving average in the SGDM algorithm. 2) Use the differential estimation term to correct the bias and lag in the SGDM algorithm, proposing the RSGDM algorithm. 3) Experiments on the CIFAR datasets have proven that our RSGDM algorithm is superior to the SGDM algorithm in terms of convergence accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes