CLNov 14, 2023

Fair Abstractive Summarization of Diverse Perspectives

arXiv:2311.07884v245 citationsh-index: 24Has Code
Originality Incremental advance
AI Analysis

This addresses the issue of bias in summarization for users and applications relying on social media, reviews, and transcripts, though it is incremental as it builds on existing summarization and fairness work.

The paper tackles the problem of fair abstractive summarization by defining fairness as not underrepresenting diverse perspectives from different social groups, and it finds that both LLM-generated and human-written summaries suffer from low fairness across six datasets.

People from different social and demographic groups express diverse perspectives and conflicting opinions on a broad set of topics such as product reviews, healthcare, law, and politics. A fair summary should provide a comprehensive coverage of diverse perspectives without underrepresenting certain groups. However, current work in summarization metrics and Large Language Models (LLMs) evaluation has not explored fair abstractive summarization. In this paper, we systematically investigate fair abstractive summarization for user-generated data. We first formally define fairness in abstractive summarization as not underrepresenting perspectives of any groups of people, and we propose four reference-free automatic metrics by measuring the differences between target and source perspectives. We evaluate nine LLMs, including three GPT models, four LLaMA models, PaLM 2, and Claude, on six datasets collected from social media, online reviews, and recorded transcripts. Experiments show that both the model-generated and the human-written reference summaries suffer from low fairness. We conduct a comprehensive analysis of the common factors influencing fairness and propose three simple but effective methods to alleviate unfair summarization. Our dataset and code are available at https://github.com/psunlpgroup/FairSumm.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes