LGQMSep 25, 2025

A Genetic Algorithm for Navigating Synthesizable Molecular Spaces

arXiv:2509.20719v13 citationsh-index: 53
Originality Incremental advance
AI Analysis

This addresses the challenge of synthesizability in molecular design for researchers in chemistry and drug discovery, though it appears incremental as it builds on existing genetic algorithms with custom modifications.

The paper tackles the problem of designing synthesizable molecules by introducing SynGA, a genetic algorithm that operates over synthesis routes, and demonstrates its effectiveness on tasks like synthesizable analog search and property optimization, achieving state-of-the-art performance when coupled with a machine learning filter.

Inspired by the effectiveness of genetic algorithms and the importance of synthesizability in molecular design, we present SynGA, a simple genetic algorithm that operates directly over synthesis routes. Our method features custom crossover and mutation operators that explicitly constrain it to synthesizable molecular space. By modifying the fitness function, we demonstrate the effectiveness of SynGA on a variety of design tasks, including synthesizable analog search and sample-efficient property optimization, for both 2D and 3D objectives. Furthermore, by coupling SynGA with a machine learning-based filter that focuses the building block set, we boost SynGA to state-of-the-art performance. For property optimization, this manifests as a model-based variant SynGBO, which employs SynGA and block filtering in the inner loop of Bayesian optimization. Since SynGA is lightweight and enforces synthesizability by construction, our hope is that SynGA can not only serve as a strong standalone baseline but also as a versatile module that can be incorporated into larger synthesis-aware workflows in the future.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes