LGAICLJan 7, 2025

More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives

arXiv:2501.04070v310 citationsh-index: 12Has CodeACL
Originality Incremental advance
AI Analysis

This addresses a key bottleneck in scaling in-context learning for AI practitioners, though it is incremental as it builds on existing ICL methods.

The paper tackles the performance plateau and decline in large language models when in-context learning demonstrations increase from few to many, introducing DrICL with differentiated and reweighting objectives to enhance performance. Results show significant improvements across various tasks, with a new benchmark (ICL-50) covering 50 tasks and up to 350 shots.

Large language models (LLMs) excel at few-shot in-context learning (ICL) without requiring parameter updates. However, as ICL demonstrations increase from a few to many, performance tends to plateau and eventually decline. We identify two primary causes for this trend: the suboptimal negative log-likelihood (NLL) optimization objective and the incremental data noise. To address these issues, we introduce \textit{DrICL}, a novel optimization method that enhances model performance through \textit{Differentiated} and \textit{Reweighting} objectives. Globally, DrICL utilizes differentiated learning to optimize the NLL objective, ensuring that many-shot performance surpasses zero-shot levels. Locally, it dynamically adjusts the weighting of many-shot demonstrations by leveraging cumulative advantages inspired by reinforcement learning, thereby mitigating the impact of noisy data. Recognizing the lack of multi-task datasets with diverse many-shot distributions, we develop the \textit{Many-Shot ICL Benchmark} (ICL-50)-a large-scale benchmark of 50 tasks that cover shot numbers from 1 to 350 within sequences of up to 8,000 tokens-for both fine-tuning and evaluation purposes. Experimental results demonstrate that LLMs enhanced with DrICL achieve significant improvements in many-shot setups across various tasks, including both in-domain and out-of-domain scenarios. We release the code and dataset hoping to facilitate further research in many-shot ICL\footnote{https://github.com/xiaoqzhwhu/DrICL}.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes