QMLGBMAug 30, 2023

RetroBridge: Modeling Retrosynthesis with Markov Bridges

arXiv:2308.16212v236 citationsh-index: 35
Originality Highly original
AI Analysis

This addresses the challenge of accurate and efficient retrosynthesis planning in chemistry, which is crucial for drug discovery and material design, representing a novel method rather than an incremental improvement.

The paper tackles single-step retrosynthesis planning by modeling it as a distribution learning problem in discrete state space, introducing the Markov Bridge Model as a generative framework that directly uses product molecules as samples, and achieves state-of-the-art results on standard benchmarks.

Retrosynthesis planning is a fundamental challenge in chemistry which aims at designing reaction pathways from commercially available starting materials to a target molecule. Each step in multi-step retrosynthesis planning requires accurate prediction of possible precursor molecules given the target molecule and confidence estimates to guide heuristic search algorithms. We model single-step retrosynthesis planning as a distribution learning problem in a discrete state space. First, we introduce the Markov Bridge Model, a generative framework aimed to approximate the dependency between two intractable discrete distributions accessible via a finite sample of coupled data points. Our framework is based on the concept of a Markov bridge, a Markov process pinned at its endpoints. Unlike diffusion-based methods, our Markov Bridge Model does not need a tractable noise distribution as a sampling proxy and directly operates on the input product molecules as samples from the intractable prior distribution. We then address the retrosynthesis planning problem with our novel framework and introduce RetroBridge, a template-free retrosynthesis modeling approach that achieves state-of-the-art results on standard evaluation benchmarks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes