LG MLJan 2

Categorical Reparameterization with Denoising Diffusion models

Samson Gourevitch, Alain Durmus, Eric Moulines, Jimmy Olsson, Yazid Janati

arXiv:2601.00781v22.71 citationsh-index: 32Has Code

Originality Incremental advance

AI Analysis

This addresses the problem of non-differentiable categorical sampling for researchers and practitioners in machine learning, offering an incremental improvement over existing gradient estimators.

The paper tackles the challenge of optimizing models with categorical variables by introducing ReDGE, a diffusion-based soft reparameterization method, which matches or outperforms existing gradient-based methods in experiments on latent variable models and inference-time reward guidance.

Learning models with categorical variables requires optimizing expectations over discrete distributions, a setting in which stochastic gradient-based optimization is challenging due to the non-differentiability of categorical sampling. A common workaround is to replace the discrete distribution with a continuous relaxation, yielding a smooth surrogate that admits reparameterized gradient estimates via the reparameterization trick. Building on this idea, we introduce ReDGE, a novel and efficient diffusion-based soft reparameterization method for categorical distributions. Our approach defines a flexible class of gradient estimators that includes the Straight-Through estimator as a special case. Experiments spanning latent variable models and inference-time reward guidance in discrete diffusion models demonstrate that ReDGE consistently matches or outperforms existing gradient-based methods. The code will be made available at https://github.com/samsongourevitch/redge.

View on arXiv PDF Code

Similar