BMLGMay 18, 2023

DrugChat: Towards Enabling ChatGPT-Like Capabilities on Drug Molecule Graphs

arXiv:2309.03907v175 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses the need for AI-assisted tools in pharmaceutical research to accelerate drug discovery and reduce failure rates, though it is an initial attempt and incremental in combining existing methods.

The authors tackled the problem of enabling ChatGPT-like interactive question-answering on drug molecule graphs by developing DrugChat, a prototype system that integrates a graph neural network, a large language model, and an adaptor, trained on a dataset of 10,834 compounds and 143,517 question-answer pairs.

A ChatGPT-like system for drug compounds could be a game-changer in pharmaceutical research, accelerating drug discovery, enhancing our understanding of structure-activity relationships, guiding lead optimization, aiding drug repurposing, reducing the failure rate, and streamlining clinical trials. In this work, we make an initial attempt towards enabling ChatGPT-like capabilities on drug molecule graphs, by developing a prototype system DrugChat. DrugChat works in a similar way as ChatGPT. Users upload a compound molecule graph and ask various questions about this compound. DrugChat will answer these questions in a multi-turn, interactive manner. The DrugChat system consists of a graph neural network (GNN), a large language model (LLM), and an adaptor. The GNN takes a compound molecule graph as input and learns a representation for this graph. The adaptor transforms the graph representation produced by the GNN into another representation that is acceptable to the LLM. The LLM takes the compound representation transformed by the adaptor and users' questions about this compound as inputs and generates answers. All these components are trained end-to-end. To train DrugChat, we collected instruction tuning datasets which contain 10,834 drug compounds and 143,517 question-answer pairs. The code and data is available at \url{https://github.com/UCSD-AI4H/drugchat}

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes