LGAIFeb 1, 2024

LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law

arXiv:2402.00795v437 citationsh-index: 23EMNLP
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of understanding LLMs' emergent capabilities in time-series forecasting for researchers in AI and physics, though it is incremental in exploring existing models on new data.

The study investigated whether large language models (LLMs) can extrapolate the behavior of dynamical systems governed by physical principles, finding that LLaMA 2 achieves accurate zero-shot predictions without fine-tuning, with accuracy improving as the input context window lengthens, revealing an in-context neural scaling law.

Pretrained large language models (LLMs) are surprisingly effective at performing zero-shot tasks, including time-series forecasting. However, understanding the mechanisms behind such capabilities remains highly challenging due to the complexity of the models. We study LLMs' ability to extrapolate the behavior of dynamical systems whose evolution is governed by principles of physical interest. Our results show that LLaMA 2, a language model trained primarily on texts, achieves accurate predictions of dynamical system time series without fine-tuning or prompt engineering. Moreover, the accuracy of the learned physical rules increases with the length of the input context window, revealing an in-context version of neural scaling law. Along the way, we present a flexible and efficient algorithm for extracting probability density functions of multi-digit numbers directly from LLMs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes