CLJan 12, 2024

Misconfidence-based Demonstration Selection for LLM In-Context Learning

arXiv:2401.06301v110.427 citationsh-index: 6

Originality Incremental advance

AI Analysis

This addresses a key bottleneck in making in-context learning more practical and cost-effective for users of LLMs, though it is incremental over existing methods.

The paper tackles the problem of selecting demonstrations for in-context learning with large language models, proposing In-Context Reflection (ICR) to reduce output discrepancies, which achieves an average 4% performance boost across five datasets.

In-context learning with large language models (LLMs) excels at adapting to various tasks rapidly. However, its success hinges on carefully selecting demonstrations, which remains an obstacle in practice. Current approaches to this problem either rely on hard-to-acquire external supervision or require frequent interactions with LLMs, resulting in high costs. We propose a new method called In-Context Reflection (ICR) to overcome these challenges. ICR strategically selects demonstrations to reduce the discrepancy between the LLM's outputs and the actual input-output mappings. Specifically, ICR starts with a random set of initial demonstrations, then iteratively refines it. In each step, it analyzes a pool of candidate examples and identifies the ones most likely to challenge the LLM's current understanding, measured by a new metric called misconfidence. These most confusing examples are then selected to replace the less informative demonstrations in the current set. Our comprehensive evaluation across five diverse datasets encompassing 13 subtasks shows the efficacy of ICR. Compared to existing methods, ICR achieves an average performance boost of 4%, while demonstrating remarkable cross-task generalization capabilities.

View on arXiv PDF

Similar