LGMLJan 2

Categorical Reparameterization with Denoising Diffusion models

arXiv:2601.00781v21 citationsh-index: 32Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of non-differentiable categorical sampling for researchers and practitioners in machine learning, offering an incremental improvement over existing gradient estimators.

The paper tackles the challenge of optimizing models with categorical variables by introducing ReDGE, a diffusion-based soft reparameterization method, which matches or outperforms existing gradient-based methods in experiments on latent variable models and inference-time reward guidance.

Learning models with categorical variables requires optimizing expectations over discrete distributions, a setting in which stochastic gradient-based optimization is challenging due to the non-differentiability of categorical sampling. A common workaround is to replace the discrete distribution with a continuous relaxation, yielding a smooth surrogate that admits reparameterized gradient estimates via the reparameterization trick. Building on this idea, we introduce ReDGE, a novel and efficient diffusion-based soft reparameterization method for categorical distributions. Our approach defines a flexible class of gradient estimators that includes the Straight-Through estimator as a special case. Experiments spanning latent variable models and inference-time reward guidance in discrete diffusion models demonstrate that ReDGE consistently matches or outperforms existing gradient-based methods. The code will be made available at https://github.com/samsongourevitch/redge.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes