LGSep 29, 2022

DiGress: Discrete Denoising diffusion for graph generation

arXiv:2209.14734v4624 citationsh-index: 63
AI Analysis

This addresses the challenge of generating complex graphs for applications like drug discovery and network analysis, representing a novel method rather than an incremental improvement.

The authors tackled the problem of generating graphs with categorical node and edge attributes by introducing DiGress, a discrete denoising diffusion model, which achieved state-of-the-art performance on molecular and non-molecular datasets, including up to 3x validity improvement on a planar graph dataset and scaling to 1.3M drug-like molecules without molecule-specific representations.

This work introduces DiGress, a discrete denoising diffusion model for generating graphs with categorical node and edge attributes. Our model utilizes a discrete diffusion process that progressively edits graphs with noise, through the process of adding or removing edges and changing the categories. A graph transformer network is trained to revert this process, simplifying the problem of distribution learning over graphs into a sequence of node and edge classification tasks. We further improve sample quality by introducing a Markovian noise model that preserves the marginal distribution of node and edge types during diffusion, and by incorporating auxiliary graph-theoretic features. A procedure for conditioning the generation on graph-level features is also proposed. DiGress achieves state-of-the-art performance on molecular and non-molecular datasets, with up to 3x validity improvement on a planar graph dataset. It is also the first model to scale to the large GuacaMol dataset containing 1.3M drug-like molecules without the use of molecule-specific representations.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes