AICYFeb 28, 2024

Random Silicon Sampling: Simulating Human Sub-Population Opinion Using a Large Language Model Based on Group-Level Demographic Information

arXiv:2402.18144v145 citationsh-index: 14
Originality Incremental advance
AI Analysis

This addresses the challenge of emulating human opinions for researchers or pollsters using AI, but it is incremental as it builds on existing ideas about biases in language models.

The study tackled the problem of simulating human sub-population opinions by using large language models based solely on group-level demographic information, finding that models can generate response distributions remarkably similar to actual U.S. public opinion polls, with replicability varying by demographic group and topic due to societal biases.

Large language models exhibit societal biases associated with demographic information, including race, gender, and others. Endowing such language models with personalities based on demographic data can enable generating opinions that align with those of humans. Building on this idea, we propose "random silicon sampling," a method to emulate the opinions of the human population sub-group. Our study analyzed 1) a language model that generates the survey responses that correspond with a human group based solely on its demographic distribution and 2) the applicability of our methodology across various demographic subgroups and thematic questions. Through random silicon sampling and using only group-level demographic information, we discovered that language models can generate response distributions that are remarkably similar to the actual U.S. public opinion polls. Moreover, we found that the replicability of language models varies depending on the demographic group and topic of the question, and this can be attributed to inherent societal biases in the models. Our findings demonstrate the feasibility of mirroring a group's opinion using only demographic distribution and elucidate the effect of social biases in language models on such simulations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes