CLAIJul 25, 2024

Examining the Influence of Political Bias on Large Language Model Performance in Stance Classification

arXiv:2407.17688v215 citationsh-index: 16
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of bias in AI for researchers and practitioners, but it is incremental as it extends existing bias analysis to political contexts without introducing new methods.

The study investigated whether political biases in large language models affect their performance in stance classification tasks, finding statistically significant differences in accuracy across politically oriented datasets, with poorer performance when targets are ambiguous.

Large Language Models (LLMs) have demonstrated remarkable capabilities in executing tasks based on natural language queries. However, these models, trained on curated datasets, inherently embody biases ranging from racial to national and gender biases. It remains uncertain whether these biases impact the performance of LLMs for certain tasks. In this study, we investigate the political biases of LLMs within the stance classification task, specifically examining whether these models exhibit a tendency to more accurately classify politically-charged stances. Utilizing three datasets, seven LLMs, and four distinct prompting schemes, we analyze the performance of LLMs on politically oriented statements and targets. Our findings reveal a statistically significant difference in the performance of LLMs across various politically oriented stance classification tasks. Furthermore, we observe that this difference primarily manifests at the dataset level, with models and prompting schemes showing statistically similar performances across different stance classification datasets. Lastly, we observe that when there is greater ambiguity in the target the statement is directed towards, LLMs have poorer stance classification accuracy. Code & Dataset: http://doi.org/10.5281/zenodo.12938478

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes