LG AI QMJul 12, 2024

Accelerating the inference of string generation-based chemical reaction models for industrial applications

Mikhail Andronov, Natalia Andronova, Michael Wand, Jürgen Schmidhuber, Djork-Arné Clevert

arXiv:2407.09685v29.27 citationsh-index: 27

Originality Incremental advance

AI Analysis

This addresses a bottleneck for industrial applications in computer-aided synthesis planning, but it is incremental as it optimizes an existing method.

The paper tackled the problem of slow inference speed in template-free SMILES-to-SMILES translation models for chemical reaction prediction and retrosynthesis, achieving over 3X faster inference with no loss in accuracy.

Template-free SMILES-to-SMILES translation models for reaction prediction and single-step retrosynthesis are of interest for industrial applications in computer-aided synthesis planning systems due to their state-of-the-art accuracy. However, they suffer from slow inference speed. We present a method to accelerate inference in autoregressive SMILES generators through speculative decoding by copying query string subsequences into target strings in the right places. We apply our method to the molecular transformer implemented in Pytorch Lightning and achieve over 3X faster inference in reaction prediction and single-step retrosynthesis, with no loss in accuracy.

View on arXiv PDF

Similar