LGApr 21

Multi-Objective Reinforcement Learning for Generating Covalent Inhibitor Candidates

arXiv:2604.200194.5
Predicted impact top 98% in LG · last 90 daysOriginality Incremental advance
AI Analysis

This provides a tool for medicinal chemists in covalent drug discovery, though it is incremental as it builds on existing RL and generative methods for a specific domain.

The paper tackled the problem of designing covalent inhibitors by optimizing multiple properties like binding affinity and selectivity, using a multi-objective reinforcement learning pipeline applied to EGFR and ACHE targets, resulting in rediscovery rates up to 0.74% and generation of novel warhead motifs not in the training data.

Rational design of covalent inhibitors requires simultaneously optimizing multiple properties, such as binding affinity, target selectivity, or electrophilic reactivity. This presents a multi-objective problem not easily addressed by screening alone. Here we present a machine learning pipeline for generating covalent inhibitor candidates using multi-objective reinforcement learning (RL), applied to two targets: epidermal growth factor receptor (EGFR) and acetylcholinesterase (ACHE). A SMILES-based pretrained LSTM serves as the generative model, optimized via policy gradient RL with Pareto crowding distance to balance competing scoring functions including synthetic accessibility, predicted covalent activity, residue affinity, and an approximated docking score. The pipeline rediscovers known covalent inhibitors at rates of up to 0.50% (EGFR) and 0.74% (ACHE) in 10,000-structure runs, with candidate structures achieving warhead-to-residue distances as short as 5.5 angstrom (EGFR) and 3.2 angstrom (ACHE) after further docking-based screening. More notably, the pipeline spontaneously generates structures bearing warhead motifs absent from the training data - including allenes, 3-oxo-$β$-sultams, and $α$-methylene-$β$-lactones - all of which have independent literature support as covalent warheads. These results suggest that RL-guided generation can explore covalent chemical space beyond its training distribution, and may be useful as a tool for medicinal chemists working on covalent drug discovery.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes