LG MLMar 17, 2020

A comprehensive study on the prediction reliability of graph neural networks for virtual screening

Soojung Yang, Kyung Hoon Lee, Seongok Ryu

arXiv:2003.07611v15.87 citations

Originality Incremental advance

AI Analysis

This work addresses the need for reliable decision-making in virtual screening for drug discovery, though it is incremental as it builds on existing methods with specific optimizations.

The study tackled the problem of unreliable probabilistic predictions in graph neural networks for virtual screening, especially under sparse and imbalanced data, by proposing training guidelines that emphasize regularization and inference methods, resulting in improved success rates.

Prediction models based on deep neural networks are increasingly gaining attention for fast and accurate virtual screening systems. For decision makings in virtual screening, researchers find it useful to interpret an output of classification system as probability, since such interpretation allows them to filter out more desirable compounds. However, probabilistic interpretation cannot be correct for models that hold over-parameterization problems or inappropriate regularizations, leading to unreliable prediction and decision making. In this regard, we concern the reliability of neural prediction models on molecular properties, especially when models are trained with sparse data points and imbalanced distributions. This work aims to propose guidelines for training reliable models, we thus provide methodological details and ablation studies on the following train principles. We investigate the effects of model architectures, regularization methods, and loss functions on the prediction performance and reliability of classification results. Moreover, we evaluate prediction reliability of models on virtual screening scenario. Our result highlights that correct choice of regularization and inference methods is evidently important to achieve high success rate, especially in data imbalanced situation. All experiments were performed under a single unified model implementation to alleviate external randomness in model training and to enable precise comparison of results.

View on arXiv PDF

Similar