CLAILGMar 6, 2025

Enough Coin Flips Can Make LLMs Act Bayesian

arXiv:2503.04722v218 citationsh-index: 8ACL
Originality Incremental advance
AI Analysis

This addresses the problem of understanding LLM reasoning mechanisms for researchers, showing incremental insights into their Bayesian consistency.

The study investigated whether large language models (LLMs) use in-context learning for Bayesian reasoning with biased coin flips, finding that with enough demonstrations, LLMs update priors in a Bayesian manner, though initial biases and miscalibrations cause deviations.

Large language models (LLMs) exhibit the ability to generalize given few-shot examples in their input prompt, an emergent capability known as in-context learning (ICL). We investigate whether LLMs use ICL to perform structured reasoning in ways that are consistent with a Bayesian framework or rely on pattern matching. Using a controlled setting of biased coin flips, we find that: (1) LLMs often possess biased priors, causing initial divergence in zero-shot settings, (2) in-context evidence outweighs explicit bias instructions, (3) LLMs broadly follow Bayesian posterior updates, with deviations primarily due to miscalibrated priors rather than flawed updates, and (4) attention magnitude has negligible effect on Bayesian inference. With sufficient demonstrations of biased coin flips via ICL, LLMs update their priors in a Bayesian manner.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes