CLAIMar 23, 2023

Fairness-guided Few-shot Prompting for Large Language Models

Tencent
arXiv:2303.13217v375 citationsh-index: 80
Originality Incremental advance
AI Analysis

This addresses the problem of unreliable few-shot prompting for users of large language models, offering an incremental improvement in prompt selection.

The paper tackles the instability in few-shot prompting for large language models by linking predictive bias to performance, proposing a greedy search strategy to find near-optimal prompts that enhance in-context learning across tasks with GPT-3.

Large language models have demonstrated surprising ability to perform in-context learning, i.e., these models can be directly applied to solve numerous downstream tasks by conditioning on a prompt constructed by a few input-output examples. However, prior research has shown that in-context learning can suffer from high instability due to variations in training examples, example order, and prompt formats. Therefore, the construction of an appropriate prompt is essential for improving the performance of in-context learning. In this paper, we revisit this problem from the view of predictive bias. Specifically, we introduce a metric to evaluate the predictive bias of a fixed prompt against labels or a given attributes. Then we empirically show that prompts with higher bias always lead to unsatisfactory predictive quality. Based on this observation, we propose a novel search strategy based on the greedy search to identify the near-optimal prompt for improving the performance of in-context learning. We perform comprehensive experiments with state-of-the-art mainstream models such as GPT-3 on various downstream tasks. Our results indicate that our method can enhance the model's in-context learning performance in an effective and interpretable manner.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes