CLAILGMay 22, 2025

From Compression to Expression: A Layerwise Analysis of In-Context Learning

arXiv:2505.17322v25 citationsh-index: 5
Originality Incremental advance
AI Analysis

This provides insights into how in-context learning works internally, which is significant for researchers and practitioners aiming to optimize and interpret large language models, though it is incremental as it builds on existing ICL studies.

The paper tackles the problem of understanding the internal representational mechanisms of in-context learning in large language models, revealing a layerwise compression-expression phenomenon where early layers compress task information and later layers express it, which consistently improves performance with model size and demonstration count.

In-context learning (ICL) enables large language models (LLMs) to adapt to new tasks without weight updates by learning from demonstration sequences. While ICL shows strong empirical performance, its internal representational mechanisms are not yet well understood. In this work, we conduct a statistical geometric analysis of ICL representations to investigate how task-specific information is captured across layers. Our analysis reveals an intriguing phenomenon, which we term *Layerwise Compression-Expression*: early layers progressively produce compact and discriminative representations that encode task information from the input demonstrations, while later layers express these representations to incorporate the query and generate the prediction. This phenomenon is observed consistently across diverse tasks and a range of contemporary LLM architectures. We demonstrate that it has important implications for ICL performance -- improving with model size and the number of demonstrations -- and for robustness in the presence of noisy examples. To further understand the effect of the compact task representation, we propose a bias-variance decomposition and provide a theoretical analysis showing how attention mechanisms contribute to reducing both variance and bias, thereby enhancing performance as the number of demonstrations increases. Our findings reveal an intriguing layerwise dynamic in ICL, highlight how structured representations emerge within LLMs, and showcase that analyzing internal representations can facilitate a deeper understanding of model behavior.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes