QMLGApr 14, 2023

Symbiotic Message Passing Model for Transfer Learning between Anti-Fungal and Anti-Bacterial Domains

arXiv:2304.07017v1h-index: 15
Originality Incremental advance
AI Analysis

This work addresses the challenge of limited experimental data in drug discovery by enabling more effective transfer learning between domains, such as from bacteria to fungi, which is incremental but practical for pharmaceutical research.

The authors tackled the problem of predicting anti-fungal activity from anti-bacterial activity in drug discovery by developing the Symbiotic Message Passing Neural Network (SMPNN), which merges graph neural network models from different domains using new message passing lanes, resulting in better and less variable performance compared to standard transfer learning approaches.

Machine learning, and representation learning in particular, has the potential to facilitate drug discovery by screening billions of compounds. For example, a successful approach is representing the molecules as a graph and utilizing graph neural networks (GNN). Yet, these approaches still require experimental measurements of thousands of compounds to construct a proper training set. While in some domains it is easier to acquire experimental data, in others it might be more limited. For example, it is easier to test the compounds on bacteria than perform in-vivo experiments. Thus, a key question is how to utilize information from a large available dataset together with a small subset of compounds where both domains are measured to predict compounds' effect on the second, experimentally less available domain. Current transfer learning approaches for drug discovery, including training of pre-trained modules or meta-learning, have limited success. In this work, we develop a novel method, named Symbiotic Message Passing Neural Network (SMPNN), for merging graph-neural-network models from different domains. Using routing new message passing lanes between them, our approach resolves some of the potential conflicts between the different domains, and implicit constraints induced by the larger datasets. By collecting public data and performing additional high-throughput experiments, we demonstrate the advantage of our approach by predicting anti-fungal activity from anti-bacterial activity. We compare our method to the standard transfer learning approach and show that SMPNN provided better and less variable performances. Our approach is general and can be used to facilitate information transfer between any two domains such as different organisms, different organelles, or different environments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes