BMLGNov 22, 2023

Accelerating Inference in Molecular Diffusion Models with Latent Representations of Protein Structure

arXiv:2311.13466v26 citationsh-index: 33
Originality Incremental advance
AI Analysis

This work addresses the challenge of slow inference speeds in molecular diffusion models for structural biology and drug design, representing an incremental improvement.

The paper tackled the problem of slow inference in molecular diffusion models by introducing a novel GNN-based architecture for learning latent representations of protein structure, achieving comparable performance to all-atom models with a 3-fold reduction in inference time.

Diffusion generative models have emerged as a powerful framework for addressing problems in structural biology and structure-based drug design. These models operate directly on 3D molecular structures. Due to the unfavorable scaling of graph neural networks (GNNs) with graph size as well as the relatively slow inference speeds inherent to diffusion models, many existing molecular diffusion models rely on coarse-grained representations of protein structure to make training and inference feasible. However, such coarse-grained representations discard essential information for modeling molecular interactions and impair the quality of generated structures. In this work, we present a novel GNN-based architecture for learning latent representations of molecular structure. When trained end-to-end with a diffusion model for de novo ligand design, our model achieves comparable performance to one with an all-atom protein representation while exhibiting a 3-fold reduction in inference time.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes