LGSep 7, 2023

DBsurf: A Discrepancy Based Method for Discrete Stochastic Gradient Estimation

arXiv:2309.03974v1h-index: 18
Originality Incremental advance
AI Analysis

This addresses gradient estimation issues in fields like machine learning and engineering, offering incremental improvements for tasks such as VAE training and neural architecture search.

The paper tackles the problem of inaccurate gradient estimation in discrete stochastic settings by introducing DBsurf, a reinforce-based estimator that reduces discrepancy between samples and the distribution, achieving the lowest variance in a benchmark least squares problem and best results for training VAEs across datasets.

Computing gradients of an expectation with respect to the distributional parameters of a discrete distribution is a problem arising in many fields of science and engineering. Typically, this problem is tackled using Reinforce, which frames the problem of gradient estimation as a Monte Carlo simulation. Unfortunately, the Reinforce estimator is especially sensitive to discrepancies between the true probability distribution and the drawn samples, a common issue in low sampling regimes that results in inaccurate gradient estimates. In this paper, we introduce DBsurf, a reinforce-based estimator for discrete distributions that uses a novel sampling procedure to reduce the discrepancy between the samples and the actual distribution. To assess the performance of our estimator, we subject it to a diverse set of tasks. Among existing estimators, DBsurf attains the lowest variance in a least squares problem commonly used in the literature for benchmarking. Furthermore, DBsurf achieves the best results for training variational auto-encoders (VAE) across different datasets and sampling setups. Finally, we apply DBsurf to build a simple and efficient Neural Architecture Search (NAS) algorithm with state-of-the-art performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes