LLM Voting: Human Choices and AI Collective Decision Making
It addresses the problem of potential biases and reduced diversity when using LLMs in democratic voting processes, which is an incremental study on AI alignment.
This paper investigated how Large Language Models (LLMs) like GPT-4 and LLaMA-2 behave in voting scenarios, finding that their choices are influenced by methods and presentation order, and that varying personas can reduce biases and improve alignment with human patterns, though they may lead to less diverse outcomes.
This paper investigates the voting behaviors of Large Language Models (LLMs), specifically GPT-4 and LLaMA-2, their biases, and how they align with human voting patterns. Our methodology involved using a dataset from a human voting experiment to establish a baseline for human preferences and conducting a corresponding experiment with LLM agents. We observed that the choice of voting methods and the presentation order influenced LLM voting outcomes. We found that varying the persona can reduce some of these biases and enhance alignment with human choices. While the Chain-of-Thought approach did not improve prediction accuracy, it has potential for AI explainability in the voting process. We also identified a trade-off between preference diversity and alignment accuracy in LLMs, influenced by different temperature settings. Our findings indicate that LLMs may lead to less diverse collective outcomes and biased assumptions when used in voting scenarios, emphasizing the need for cautious integration of LLMs into democratic processes.