CLAIOct 25, 2023

Improving Diversity of Demographic Representation in Large Language Models via Collective-Critiques and Self-Voting

arXiv:2310.16523v1150 citationsh-index: 45
Originality Incremental advance
AI Analysis

This addresses the issue of homogenization and erasure of demographic groups in AI-generated content, though it is incremental as it builds on existing prompting methods.

The paper tackled the problem of demographic under-representation in LLM responses by formalizing diversity metrics and proposing a prompting technique called collective-critique and self-voting (CCSV), which significantly improved diversity over baselines in experiments.

A crucial challenge for generative large language models (LLMs) is diversity: when a user's prompt is under-specified, models may follow implicit assumptions while generating a response, which may result in homogenization of the responses, as well as certain demographic groups being under-represented or even erased from the generated responses. In this paper, we formalize diversity of representation in generative LLMs. We present evaluation datasets and propose metrics to measure diversity in generated responses along people and culture axes. We find that LLMs understand the notion of diversity, and that they can reason and critique their own responses for that goal. This finding motivated a new prompting technique called collective-critique and self-voting (CCSV) to self-improve people diversity of LLMs by tapping into its diversity reasoning capabilities, without relying on handcrafted examples or prompt tuning. Extensive empirical experiments with both human and automated evaluations show that our proposed approach is effective at improving people and culture diversity, and outperforms all baseline methods by a large margin.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes