AIMay 22, 2025

MAPLE: Many-Shot Adaptive Pseudo-Labeling for In-Context Learning

arXiv:2505.16225v212 citationsh-index: 18ICML
Originality Incremental advance
AI Analysis

This addresses the problem of expensive data labeling for researchers and practitioners using many-shot in-context learning, representing an incremental improvement.

The paper tackles the high cost of labeled data in many-shot in-context learning for large language models by proposing MAPLE, a framework that uses pseudo-labeled samples to improve performance without significant labeling costs, demonstrating effectiveness on real-world datasets.

In-Context Learning (ICL) empowers Large Language Models (LLMs) to tackle diverse tasks by incorporating multiple input-output examples, known as demonstrations, into the input of LLMs. More recently, advancements in the expanded context windows of LLMs have led to many-shot ICL, which uses hundreds of demonstrations and outperforms few-shot ICL, which relies on fewer examples. However, this approach is often hindered by the high cost of obtaining large amounts of labeled data. To address this challenge, we propose Many-Shot Adaptive Pseudo-LabEling, namely MAPLE, a novel influence-based many-shot ICL framework that utilizes pseudo-labeled samples to compensate for the lack of label information. We first identify a subset of impactful unlabeled samples and perform pseudo-labeling on them by querying LLMs. These pseudo-labeled samples are then adaptively selected and tailored to each test query as input to improve the performance of many-shot ICL, without significant labeling costs. Extensive experiments on real-world datasets demonstrate the effectiveness of our framework, showcasing its ability to enhance LLM adaptability and performance with limited labeled data.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes