LGMTRL-SCIJun 3, 2024

In-Context Learning of Physical Properties: Few-Shot Adaptation to Out-of-Distribution Molecular Graphs

arXiv:2406.01808v11 citations
Originality Incremental advance
AI Analysis

This addresses the problem of few-shot adaptation for molecular property prediction, offering a novel approach but with incremental advancements in integrating existing methods.

The paper tackled predicting out-of-distribution materials properties by leveraging in-context learning with a compound model combining GPT-2 and geometry-aware graph neural networks, resulting in significant performance improvements over general graph neural network models on the QM9 dataset.

Large language models manifest the ability of few-shot adaptation to a sequence of provided examples. This behavior, known as in-context learning, allows for performing nontrivial machine learning tasks during inference only. In this work, we address the question: can we leverage in-context learning to predict out-of-distribution materials properties? However, this would not be possible for structure property prediction tasks unless an effective method is found to pass atomic-level geometric features to the transformer model. To address this problem, we employ a compound model in which GPT-2 acts on the output of geometry-aware graph neural networks to adapt in-context information. To demonstrate our model's capabilities, we partition the QM9 dataset into sequences of molecules that share a common substructure and use them for in-context learning. This approach significantly improves the performance of the model on out-of-distribution examples, surpassing the one of general graph neural network models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes