LGAIBMSep 21, 2024

Mitigating Exposure Bias in Score-Based Generation of Molecular Conformations

arXiv:2409.14014v1h-index: 7Has Code
Originality Incremental advance
AI Analysis

This addresses a critical problem in computational chemistry for generating accurate and diverse molecular conformations, though it is incremental as it adapts an existing method from diffusion models.

The paper tackles exposure bias in score-based generative models for molecular conformation generation by proposing a measurement method and an Input Perturbation compensation algorithm, achieving new state-of-the-art performance on the GEOM-Drugs dataset and competitive results on GEOM-QM9.

Molecular conformation generation poses a significant challenge in the field of computational chemistry. Recently, Diffusion Probabilistic Models (DPMs) and Score-Based Generative Models (SGMs) are effectively used due to their capacity for generating accurate conformations far beyond conventional physics-based approaches. However, the discrepancy between training and inference rises a critical problem known as the exposure bias. While this issue has been extensively investigated in DPMs, the existence of exposure bias in SGMs and its effective measurement remain unsolved, which hinders the use of compensation methods for SGMs, including ConfGF and Torsional Diffusion as the representatives. In this work, we first propose a method for measuring exposure bias in SGMs used for molecular conformation generation, which confirms the significant existence of exposure bias in these models and measures its value. We design a new compensation algorithm Input Perturbation (IP), which is adapted from a method originally designed for DPMs only. Experimental results show that by introducing IP, SGM-based molecular conformation models can significantly improve both the accuracy and diversity of the generated conformations. Especially by using the IP-enhanced Torsional Diffusion model, we achieve new state-of-the-art performance on the GEOM-Drugs dataset and are on par on GEOM-QM9. We provide the code publicly at https://github.com/jia-975/torsionalDiff-ip.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes