BM SOFT LGJun 3, 2023

Mitigating Molecular Aggregation in Drug Discovery with Predictive Insights from Explainable AI

Hunter Sturm, Jonas Teufel, Kaitlin A. Isfeld, Pascal Friederich, Rebecca L. Davis

arXiv:2306.02206v22.36 citationsh-index: 31Has Code

Originality Incremental advance

AI Analysis

This work addresses a long-standing issue in high-throughput screening for drug discovery, offering a method to accelerate lead molecule identification, though it is incremental as it builds on existing xAI approaches.

The paper tackles the problem of false positives in drug discovery caused by small colloidally aggregating molecules (SCAMs) by applying an explainable AI model called MEGAN, which identifies SCAMs and uses counterfactuals to design alternative compounds, with experimental validation showing utility in altering aggregation properties.

Herein, we present the application of MEGAN, our explainable AI (xAI) model, for the identification of small colloidally aggregating molecules (SCAMs). This work offers solutions to the long-standing problem of false positives caused by SCAMs in high throughput screening for drug discovery and demonstrates the power of xAI in the classification of molecular properties that are not chemically intuitive based on our current understanding. We leverage xAI insights and molecular counterfactuals to design alternatives to problematic compounds in drug screening libraries. Additionally, we experimentally validate the MEGAN prediction classification for one of the counterfactuals and demonstrate the utility of counterfactuals for altering the aggregation properties of a compound through minor structural modifications. The integration of this method in high-throughput screening approaches will help combat and circumvent false positives, providing better lead molecules more rapidly and thus accelerating drug discovery cycles.

View on arXiv PDF Code

Similar