LGJun 10, 2025

Understanding Task Vectors in In-Context Learning: Emergence, Functionality, and Limitations

arXiv:2506.09048v16 citationsh-index: 5
Originality Incremental advance
AI Analysis

This work addresses a fundamental gap in understanding task vectors for researchers in in-context learning, though it is incremental in advancing mechanistic insights.

The paper investigates the emergence and functionality of task vectors in in-context learning, proposing the Linear Combination Conjecture that they act as linear combinations of demonstrations, and validates this with theoretical and empirical analyses, including predictions of failure on high-rank mappings.

Task vectors offer a compelling mechanism for accelerating inference in in-context learning (ICL) by distilling task-specific information into a single, reusable representation. Despite their empirical success, the underlying principles governing their emergence and functionality remain unclear. This work proposes the Linear Combination Conjecture, positing that task vectors act as single in-context demonstrations formed through linear combinations of the original ones. We provide both theoretical and empirical support for this conjecture. First, we show that task vectors naturally emerge in linear transformers trained on triplet-formatted prompts through loss landscape analysis. Next, we predict the failure of task vectors on representing high-rank mappings and confirm this on practical LLMs. Our findings are further validated through saliency analyses and parameter visualization, suggesting an enhancement of task vectors by injecting multiple ones into few-shot prompts. Together, our results advance the understanding of task vectors and shed light on the mechanisms underlying ICL in transformer-based models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes