LGCLCVOct 24, 2025

FairImagen: Post-Processing for Bias Mitigation in Text-to-Image Models

arXiv:2510.21363v13 citationsh-index: 3Has Code
Originality Incremental advance
AI Analysis

It addresses fairness issues in AI-generated images for users and developers, offering an incremental improvement as a post-hoc method without retraining.

The paper tackles bias in text-to-image models like Stable Diffusion by introducing FairImagen, a post-processing framework that reduces demographic biases in generated images with a moderate trade-off in quality and fidelity.

Text-to-image diffusion models, such as Stable Diffusion, have demonstrated remarkable capabilities in generating high-quality and diverse images from natural language prompts. However, recent studies reveal that these models often replicate and amplify societal biases, particularly along demographic attributes like gender and race. In this paper, we introduce FairImagen (https://github.com/fuzihaofzh/FairImagen), a post-hoc debiasing framework that operates on prompt embeddings to mitigate such biases without retraining or modifying the underlying diffusion model. Our method integrates Fair Principal Component Analysis to project CLIP-based input embeddings into a subspace that minimizes group-specific information while preserving semantic content. We further enhance debiasing effectiveness through empirical noise injection and propose a unified cross-demographic projection method that enables simultaneous debiasing across multiple demographic attributes. Extensive experiments across gender, race, and intersectional settings demonstrate that FairImagen significantly improves fairness with a moderate trade-off in image quality and prompt fidelity. Our framework outperforms existing post-hoc methods and offers a simple, scalable, and model-agnostic solution for equitable text-to-image generation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes