LG AIFeb 1, 2024

LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law

Toni J. B. Liu, Nicolas Boullé, Raphaël Sarfati, Christopher J. Earls

arXiv:2402.00795v429.537 citationsh-index: 21Has CodeEMNLP

Originality Synthesis-oriented

AI Analysis

This work addresses the problem of understanding LLMs' emergent capabilities in time-series forecasting for researchers in AI and physics, though it is incremental in exploring existing models on new data.

The study investigated whether large language models (LLMs) can extrapolate the behavior of dynamical systems governed by physical principles, finding that LLaMA 2 achieves accurate zero-shot predictions without fine-tuning, with accuracy improving as the input context window lengthens, revealing an in-context neural scaling law.

Pretrained large language models (LLMs) are surprisingly effective at performing zero-shot tasks, including time-series forecasting. However, understanding the mechanisms behind such capabilities remains highly challenging due to the complexity of the models. We study LLMs' ability to extrapolate the behavior of dynamical systems whose evolution is governed by principles of physical interest. Our results show that LLaMA 2, a language model trained primarily on texts, achieves accurate predictions of dynamical system time series without fine-tuning or prompt engineering. Moreover, the accuracy of the learned physical rules increases with the length of the input context window, revealing an in-context version of neural scaling law. Along the way, we present a flexible and efficient algorithm for extracting probability density functions of multi-digit numbers directly from LLMs.

View on arXiv PDF Code

Similar