AIFeb 19

MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models

arXiv:2602.17602v1h-index: 4
Originality Highly original
AI Analysis

This work addresses the challenge of generating valid molecular graphs for drug discovery and materials science, representing an incremental advance over existing graph diffusion methods.

The paper tackled the problem of low chemical validity in molecular graph generation by introducing MolHIT, a framework based on hierarchical discrete diffusion models, which achieved new state-of-the-art performance on the MOSES dataset with near-perfect validity and surpassed 1D baselines across multiple metrics.

Molecular generation with diffusion models has emerged as a promising direction for AI-driven drug discovery and materials science. While graph diffusion models have been widely adopted due to the discrete nature of 2D molecular graphs, existing models suffer from low chemical validity and struggle to meet the desired properties compared to 1D modeling. In this work, we introduce MolHIT, a powerful molecular graph generation framework that overcomes long-standing performance limitations in existing methods. MolHIT is based on the Hierarchical Discrete Diffusion Model, which generalizes discrete diffusion to additional categories that encode chemical priors, and decoupled atom encoding that splits the atom types according to their chemical roles. Overall, MolHIT achieves new state-of-the-art performance on the MOSES dataset with near-perfect validity for the first time in graph diffusion, surpassing strong 1D baselines across multiple metrics. We further demonstrate strong performance in downstream tasks, including multi-property guided generation and scaffold extension.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes