CLCYJul 18, 2025

PRIDE -- Parameter-Efficient Reduction of Identity Discrimination for Equality in LLMs

arXiv:2507.13743v1h-index: 21Has Code
Originality Synthesis-oriented
AI Analysis

This addresses bias reduction for LGBTQIA+ users in LLMs, but it is incremental as it applies existing PEFT methods to a known issue.

The paper tackled the problem of gender- and sexual-identity biases in LLMs, showing that fine-tuning with LoRA (using <0.1% additional parameters) reduced bias scores by up to 50 points and increased neutrality from near 0% to 36% on the WinoQueer benchmark.

Large Language Models (LLMs) frequently reproduce the gender- and sexual-identity prejudices embedded in their training corpora, leading to outputs that marginalize LGBTQIA+ users. Hence, reducing such biases is of great importance. To achieve this, we evaluate two parameter-efficient fine-tuning (PEFT) techniques - Low-Rank Adaptation (LoRA) and soft-prompt tuning - as lightweight alternatives to full-model fine-tuning for mitigating such biases. Using the WinoQueer benchmark, we quantify bias in three open-source LLMs and observe baseline bias scores reaching up to 98 (out of 100) across a range of queer identities defined by gender and/or sexual orientation, where 50 would indicate neutrality. Fine-tuning with LoRA (< 0.1% additional parameters) on a curated QueerNews corpus reduces those scores by up to 50 points and raises neutrality from virtually 0% to as much as 36%. Soft-prompt tuning (10 virtual tokens) delivers only marginal improvements. These findings show that LoRA can deliver meaningful fairness gains with minimal computation. We advocate broader adoption of community-informed PEFT, the creation of larger queer-authored corpora, and richer evaluation suites beyond WinoQueer, coupled with ongoing audits to keep LLMs inclusive.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes