LGAIApr 8, 2025

DMol: A Highly Efficient and Chemical Motif-Preserving Molecule Generation Platform

arXiv:2504.06312v31 citationsh-index: 3
Originality Incremental advance
AI Analysis

This work addresses molecule generation for drug discovery, offering incremental improvements in efficiency and validity.

The paper tackles the problem of generating valid small molecules efficiently by introducing DMol, a graph diffusion model that improves validity by about 1.5% over the state-of-the-art DiGress, reduces diffusion steps by at least 10-fold, and halves running time.

We introduce a new graph diffusion model for small molecule generation, DMol, which outperforms the state-of-the-art DiGress model in terms of validity by roughly 1.5% across all benchmarking datasets while reducing the number of diffusion steps by at least 10-fold, and the running time to roughly one half. The performance improvements are a result of a careful change in the objective function and a graph noise scheduling approach which, at each diffusion step, allows one to only change a subset of nodes of varying size in the molecule graph. Another relevant property of the method is that it can be easily combined with junction-tree-like graph representations that arise by compressing a collection of relevant ring structures into supernodes. Unlike classical junction-tree techniques that involve VAEs and require complicated reconstruction steps, compressed DMol directly performs graph diffusion on a graph that compresses only a carefully selected set of frequent carbon rings into supernodes, which results in straightforward sample generation. This compressed DMol method offers additional validity improvements over generic DMol of roughly 2%, increases the novelty of the method, and further improves the running time due to reductions in the graph size.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes