AISep 18, 2025

FragmentRetro: A Quadratic Retrosynthetic Method Based on Fragmentation Algorithms

arXiv:2509.15409v1h-index: 10J Chem Theory Comput
Originality Incremental advance
AI Analysis

This addresses a key bottleneck in automated chemical synthesis planning for researchers and chemists, offering a more scalable approach, though it is incremental in focusing on fragment-based solutions rather than full pathways.

The paper tackles the exponential computational complexity of retrosynthesis in computer-aided synthesis planning by introducing FragmentRetro, a method based on fragmentation algorithms that achieves quadratic complexity and high solved rates on benchmarks like PaRoutes and USPTO-190.

Retrosynthesis, the process of deconstructing a target molecule into simpler precursors, is crucial for computer-aided synthesis planning (CASP). Widely adopted tree-search methods often suffer from exponential computational complexity. In this work, we introduce FragmentRetro, a novel retrosynthetic method that leverages fragmentation algorithms, specifically BRICS and r-BRICS, combined with stock-aware exploration and pattern fingerprint screening to achieve quadratic complexity. FragmentRetro recursively combines molecular fragments and verifies their presence in a building block set, providing sets of fragment combinations as retrosynthetic solutions. We present the first formal computational analysis of retrosynthetic methods, showing that tree search exhibits exponential complexity $O(b^h)$, DirectMultiStep scales as $O(h^6)$, and FragmentRetro achieves $O(h^2)$, where $h$ represents the number of heavy atoms in the target molecule and $b$ is the branching factor for tree search. Evaluations on PaRoutes, USPTO-190, and natural products demonstrate that FragmentRetro achieves high solved rates with competitive runtime, including cases where tree search fails. The method benefits from fingerprint screening, which significantly reduces substructure matching complexity. While FragmentRetro focuses on efficiently identifying fragment-based solutions rather than full reaction pathways, its computational advantages and ability to generate strategic starting candidates establish it as a powerful foundational component for scalable and automated synthesis planning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes