CLJan 8, 2025

Small Changes, Large Consequences: Analyzing the Allocational Fairness of LLMs in Hiring Contexts

arXiv:2501.04316v25 citationsh-index: 10IJCNLP-AACL
Originality Incremental advance
AI Analysis

This work addresses fairness issues in LLM deployments for hiring, highlighting potential biases in high-stakes applications, though it is incremental as it builds on existing fairness research.

The study investigated the fairness of LLM-based hiring systems in resume summarization and applicant ranking, finding that models exhibited meaningful differences across demographic groups, with retrieval models showing high sensitivity to race and gender perturbations that could lead to discriminatory outcomes.

Large language models (LLMs) are increasingly being deployed in high-stakes applications like hiring, yet their potential for unfair decision-making remains understudied in generative and retrieval settings. In this work, we examine the allocational fairness of LLM-based hiring systems through two tasks that reflect actual HR usage: resume summarization and applicant ranking. By constructing a synthetic resume dataset with controlled perturbations and curating job postings, we investigate whether model behavior differs across demographic groups. Our findings reveal that generated summaries exhibit meaningful differences more frequently for race than for gender perturbations. Models also display non-uniform retrieval selection patterns across demographic groups and exhibit high ranking sensitivity to both gender and race perturbations. Surprisingly, retrieval models can show comparable sensitivity to both demographic and non-demographic changes, suggesting that fairness issues may stem from broader model brittleness. Overall, our results indicate that LLM-based hiring systems, especially in the retrieval stage, can exhibit notable biases that lead to discriminatory outcomes in real-world contexts.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes