LG MTRL-SCIJun 3, 2024

In-Context Learning of Physical Properties: Few-Shot Adaptation to Out-of-Distribution Molecular Graphs

Grzegorz Kaszuba, Amirhossein D. Naghdi, Dario Massa, Stefanos Papanikolaou, Andrzej Jaszkiewicz, Piotr Sankowski

arXiv:2406.01808v14.61 citationsh-index: 27

Originality Incremental advance

AI Analysis

This addresses the problem of few-shot adaptation for molecular property prediction, offering a novel approach but with incremental advancements in integrating existing methods.

The paper tackled predicting out-of-distribution materials properties by leveraging in-context learning with a compound model combining GPT-2 and geometry-aware graph neural networks, resulting in significant performance improvements over general graph neural network models on the QM9 dataset.

Large language models manifest the ability of few-shot adaptation to a sequence of provided examples. This behavior, known as in-context learning, allows for performing nontrivial machine learning tasks during inference only. In this work, we address the question: can we leverage in-context learning to predict out-of-distribution materials properties? However, this would not be possible for structure property prediction tasks unless an effective method is found to pass atomic-level geometric features to the transformer model. To address this problem, we employ a compound model in which GPT-2 acts on the output of geometry-aware graph neural networks to adapt in-context information. To demonstrate our model's capabilities, we partition the QM9 dataset into sequences of molecules that share a common substructure and use them for in-context learning. This approach significantly improves the performance of the model on out-of-distribution examples, surpassing the one of general graph neural network models.

View on arXiv PDF

Similar