LGITPRMay 8, 2025

ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model

arXiv:2505.05082v32 citationsh-index: 4
Originality Incremental advance
AI Analysis

This addresses the problem of accurate generative modeling for discrete data distributions, such as in symbolic music, with incremental improvements over existing diffusion-based methods.

The paper tackles the challenge of generative modeling for non-negative discrete data by introducing ItDPDM, which combines exact likelihood estimation with discrete-state modeling, achieving improved likelihood and sampling performance over prior models on synthetic and real-world datasets like symbolic music and images.

Generative modeling of non-negative, discrete data, such as symbolic music, remains challenging due to two persistent limitations in existing methods. Firstly, many approaches rely on modeling continuous embeddings, which is suboptimal for inherently discrete data distributions. Secondly, most models optimize variational bounds rather than exact data likelihood, resulting in inaccurate likelihood estimates and degraded sampling quality. While recent diffusion-based models have addressed these issues separately, we tackle them jointly. In this work, we introduce the Information-Theoretic Discrete Poisson Diffusion Model (ItDPDM), inspired by photon arrival process, which combines exact likelihood estimation with fully discrete-state modeling. Central to our approach is an information-theoretic Poisson Reconstruction Loss (PRL) that has a provable exact relationship with the true data likelihood. ItDPDM achieves improved likelihood and sampling performance over prior discrete and continuous diffusion models on a variety of synthetic discrete datasets. Furthermore, on real-world datasets such as symbolic music and images, ItDPDM attains superior likelihood estimates and competitive generation quality-demonstrating a proof of concept for distribution-robust discrete generative modeling.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes