CLAINov 12, 2024

Fair Summarization: Bridging Quality and Diversity in Extractive Summaries

arXiv:2411.07521v511 citationsh-index: 10Has CodeProceedings of the 3rd Workshop on Cross-Cultural Considerations in NLP (C3NLP 2025)
Originality Incremental advance
AI Analysis

This addresses fairness in summarization for NLP applications, but it is incremental as it builds on existing methods with fairness constraints.

The paper tackles the problem of biased summarization in multi-document settings by introducing FairExtract and FairGPT, which achieve superior fairness while maintaining competitive quality on a dataset of tweets from different social groups.

Fairness in multi-document summarization of user-generated content remains a critical challenge in natural language processing (NLP). Existing summarization methods often fail to ensure equitable representation across different social groups, leading to biased outputs. In this paper, we introduce two novel methods for fair extractive summarization: FairExtract, a clustering-based approach, and FairGPT, which leverages GPT-3.5-turbo with fairness constraints. We evaluate these methods using Divsumm summarization dataset of White-aligned, Hispanic, and African-American dialect tweets and compare them against relevant baselines. The results obtained using a comprehensive set of summarization quality metrics such as SUPERT, BLANC, SummaQA, BARTScore, and UniEval, as well as a fairness metric F, demonstrate that FairExtract and FairGPT achieve superior fairness while maintaining competitive summarization quality. Additionally, we introduce composite metrics (e.g., SUPERT+F, BLANC+F) that integrate quality and fairness into a single evaluation framework, offering a more nuanced understanding of the trade-offs between these objectives. Our code is available online.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes