LGMLFeb 6, 2024

Unified Discrete Diffusion for Categorical Data

arXiv:2402.03701v29 citationsh-index: 48Has Code
AI Analysis

This work addresses a bottleneck in discrete diffusion models for categorical data like language and graphs, offering a more efficient and unified approach.

The authors tackled the challenge of training and sampling in continuous-time discrete diffusion models by introducing mathematical simplifications for the variational lower bound and a unified formulation for backward denoising, resulting in USD3 outperforming all state-of-the-art baselines on established datasets.

Discrete diffusion models have seen a surge of attention with applications on naturally discrete data such as language and graphs. Although discrete-time discrete diffusion has been established for a while, only recently Campbell et al. (2022) introduced the first framework for continuous-time discrete diffusion. However, their training and sampling processes differ significantly from the discrete-time version, necessitating nontrivial approximations for tractability. In this paper, we first present a series of mathematical simplifications of the variational lower bound that enable more accurate and easy-to-optimize training for discrete diffusion. In addition, we derive a simple formulation for backward denoising that enables exact and accelerated sampling, and importantly, an elegant unification of discrete-time and continuous-time discrete diffusion. Thanks to simpler analytical formulations, both forward and now also backward probabilities can flexibly accommodate any noise distribution, including different noise distributions for multi-element objects. Experiments show that our proposed USD3 (for Unified Simplified Discrete Denoising Diffusion) outperform all SOTA baselines on established datasets. We open-source our unified code at https://github.com/LingxiaoShawn/USD3.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes