CYAICLFeb 2, 2024

The Political Preferences of LLMs

arXiv:2402.01789v2108 citationsh-index: 2Has CodePLoS ONE
Originality Incremental advance
AI Analysis

This addresses potential societal impacts of political biases in LLMs as they replace traditional information sources, though it is incremental in methodology.

The study analyzed political preferences in 24 conversational LLMs, finding most exhibit left-of-center biases, and showed that supervised fine-tuning can steer these biases with modest data.

I report here a comprehensive analysis about the political preferences embedded in Large Language Models (LLMs). Namely, I administer 11 political orientation tests, designed to identify the political preferences of the test taker, to 24 state-of-the-art conversational LLMs, both closed and open source. When probed with questions/statements with political connotations, most conversational LLMs tend to generate responses that are diagnosed by most political test instruments as manifesting preferences for left-of-center viewpoints. This does not appear to be the case for five additional base (i.e. foundation) models upon which LLMs optimized for conversation with humans are built. However, the weak performance of the base models at coherently answering the tests' questions makes this subset of results inconclusive. Finally, I demonstrate that LLMs can be steered towards specific locations in the political spectrum through Supervised Fine-Tuning (SFT) with only modest amounts of politically aligned data, suggesting SFT's potential to embed political orientation in LLMs. With LLMs beginning to partially displace traditional information sources like search engines and Wikipedia, the societal implications of political biases embedded in LLMs are substantial.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes