LG AI CLNov 4, 2024

Fair In-Context Learning via Latent Concept Variables

Karuna Bhaila, Minh-Hao Van, Kennedy Edemacu, Chen Zhao, Feng Chen, Xintao Wu

arXiv:2411.02671v26.45 citationsh-index: 10Has Code

Originality Incremental advance

AI Analysis

This addresses fairness issues in high-stakes applications of LLMs, though it is incremental as it builds on existing demonstration selection approaches.

The paper tackles inherent bias in large language models during in-context learning with tabular data by using latent concept variables for demonstration selection, resulting in improved fairness compared to heuristic methods.

The emerging in-context learning (ICL) ability of large language models (LLMs) has prompted their use for predictive tasks in various domains with different data types, including tabular data, facilitated by serialization methods. However, with increasing applications in high-stakes domains, it has been shown that LLMs can inherit social bias and discrimination from their pre-training data. In this work, we investigate inherent bias in LLMs during in-context learning with tabular data. We focus on an optimal demonstration selection approach that utilizes latent concept variables for resource-efficient task adaptation. We design data augmentation strategies that reduce the correlation between predictive outcomes and sensitive variables, helping promote fairness during latent concept learning. We utilize the learned concept to select demonstrations and obtain fair predictions. The latent concept variables are learned using a smaller internal LLM and generalized to larger external LLMs. We empirically verify that the fair latent variable approach improves fairness results on tabular datasets compared to multiple heuristic demonstration selection methods.

View on arXiv PDF Code

Similar