CLMay 24, 2025

Smoothie: Smoothing Diffusion on Token Embeddings for Text Generation

arXiv:2505.18853v17 citationsh-index: 5Has Code
Originality Highly original
AI Analysis

This addresses the problem of discrete text generation for researchers and practitioners in natural language processing, representing an incremental improvement over prior diffusion approaches.

The paper tackles the challenge of adapting diffusion models to text generation by proposing Smoothie, a method that smooths token embeddings based on semantic similarity, which outperforms existing diffusion-based models in generation quality on sequence-to-sequence tasks.

Diffusion models have achieved state-of-the-art performance in generating images, audio, and video, but their adaptation to text remains challenging due to its discrete nature. Prior approaches either apply Gaussian diffusion in continuous latent spaces, which inherits semantic structure but struggles with token decoding, or operate in categorical simplex space, which respect discreteness but disregard semantic relation between tokens. In this paper, we propose Smoothing Diffusion on Token Embeddings (Smoothie), a novel diffusion method that combines the strengths of both approaches by progressively smoothing token embeddings based on semantic similarity. This technique enables gradual information removal while maintaining a natural decoding process. Experimental results on several sequence-to-sequence generation tasks demonstrate that Smoothie outperforms existing diffusion-based models in generation quality. Furthermore, ablation studies show that our proposed diffusion space yields better performance than both the standard embedding space and the categorical simplex. Our code is available at https://github.com/ashaba1in/smoothie.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes