LGAIMNOct 31, 2024

LLaMo: Large Language Model-based Molecular Graph Assistant

arXiv:2411.00871v128 citationsh-index: 12Has CodeNIPS
Originality Highly original
AI Analysis

This work addresses the challenge of integrating language models with molecular graphs for researchers in chemistry and drug discovery, representing a novel application rather than an incremental improvement.

The authors tackled the problem of applying large language models to the molecular domain by proposing LLaMo, an end-to-end trained large molecular graph-language model, which achieved state-of-the-art performance on tasks like molecular description generation, property prediction, and IUPAC name prediction.

Large Language Models (LLMs) have demonstrated remarkable generalization and instruction-following capabilities with instruction tuning. The advancements in LLMs and instruction tuning have led to the development of Large Vision-Language Models (LVLMs). However, the competency of the LLMs and instruction tuning have been less explored in the molecular domain. Thus, we propose LLaMo: Large Language Model-based Molecular graph assistant, which is an end-to-end trained large molecular graph-language model. To bridge the discrepancy between the language and graph modalities, we present the multi-level graph projector that transforms graph representations into graph tokens by abstracting the output representations of each GNN layer and motif representations with the cross-attention mechanism. We also introduce machine-generated molecular graph instruction data to instruction-tune the large molecular graph-language model for general-purpose molecule and language understanding. Our extensive experiments demonstrate that LLaMo shows the best performance on diverse tasks, such as molecular description generation, property prediction, and IUPAC name prediction. The code of LLaMo is available at https://github.com/mlvlab/LLaMo.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes