LGNAMLOct 11, 2024

Distillation of Discrete Diffusion through Dimensional Correlations

arXiv:2410.08709v429 citationsh-index: 15Has CodeICML
Originality Highly original
AI Analysis

This addresses the computational bottleneck in discrete diffusion models for generative modeling, offering a scalable solution to improve sampling efficiency.

The paper tackles the slow sampling speed of discrete diffusion models by proposing mixture models that capture dimensional correlations and distillation loss functions, enabling distillation of many-step models into just a few steps with effectiveness shown in image and language domains.

Diffusion models have demonstrated exceptional performances in various fields of generative modeling, but suffer from slow sampling speed due to their iterative nature. While this issue is being addressed in continuous domains, discrete diffusion models face unique challenges, particularly in capturing dependencies between elements (e.g., pixel relationships in image, sequential dependencies in language) mainly due to the computational cost of processing high-dimensional joint distributions. In this paper, (i) we propose "mixture" models for discrete diffusion that are capable of treating dimensional correlations while remaining scalable, and (ii) we provide a set of loss functions for distilling the iterations of existing models. Two primary theoretical insights underpin our approach: First, conventional models with element-wise independence can well approximate the data distribution, but essentially require {\it many sampling steps}. Second, our loss functions enable the mixture models to distill such many-step conventional models into just a few steps by learning the dimensional correlations. Our experimental results show the effectiveness of the proposed method in distilling pretrained discrete diffusion models across image and language domains. The code used in the paper is available at https://github.com/sony/di4c .

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes