Finding Pathways in Reaction Networks guided by Energy Barriers using Integer Linear Programming
This work addresses the challenge of kinetically informed pathway discovery in chemistry, particularly for networks lacking kinetic annotations, though it is incremental as it builds on existing computational methods.
The authors tackled the problem of identifying synthesis pathways in chemical reaction networks by developing an integer linear programming method that incorporates energy barriers and ranks pathways based on probability, enabling flexible analysis even for networks without kinetic data. They demonstrated this on a network from 2-hydroxyethanenitrile, water, and ammonia, finding pathways to glycine and 2-hydroxyethanoic acid.
Analyzing synthesis pathways for target molecules in a chemical reaction network annotated with information on the kinetics of individual reactions is an area of active study. This work presents a computational methodology for searching for pathways in reaction networks which is based on integer linear programming and the modeling of reaction networks by directed hypergraphs. Often multiple pathways fit the given search criteria. To rank them, we develop an objective function based on physical arguments maximizing the probability of the pathway. We furthermore develop an automated pipeline to estimate the energy barriers of individual reactions in reaction networks. Combined, the methodology facilitates flexible and kinetically informed pathway investigations on large reaction networks by computational means, even for networks coming without kinetic annotation, such as those created via generative approaches for expanding molecular spaces. To demonstrate the methodology, we apply it on a chemical reaction network generated from 2-hydroxyethanenitrile, water, and ammonia, where we search for pathways to glycine and 2-hydroxyethanoic acid using the input molecules as precursors.