MLLGFeb 12

PAC-Bayesian Generalization Guarantees for Fairness on Stochastic and Deterministic Classifiers

arXiv:2602.11722v1h-index: 28
Originality Incremental advance
AI Analysis

This work addresses the need for rigorous fairness guarantees in classification models, offering a novel theoretical approach that is incremental in extending PAC-Bayes to fairness constraints.

The paper tackles the problem of providing theoretical generalization guarantees for fairness in machine learning by proposing a PAC-Bayesian framework that covers both stochastic and deterministic classifiers, resulting in bounds that are empirically shown to be tight across three fairness measures.

Classical PAC generalization bounds on the prediction risk of a classifier are insufficient to provide theoretical guarantees on fairness when the goal is to learn models balancing predictive risk and fairness constraints. We propose a PAC-Bayesian framework for deriving generalization bounds for fairness, covering both stochastic and deterministic classifiers. For stochastic classifiers, we derive a fairness bound using standard PAC-Bayes techniques. Whereas for deterministic classifiers, as usual PAC-Bayes arguments do not apply directly, we leverage a recent advance in PAC-Bayes to extend the fairness bound beyond the stochastic setting. Our framework has two advantages: (i) It applies to a broad class of fairness measures that can be expressed as a risk discrepancy, and (ii) it leads to a self-bounding algorithm in which the learning procedure directly optimizes a trade-off between generalization bounds on the prediction risk and on the fairness. We empirically evaluate our framework with three classical fairness measures, demonstrating not only its usefulness but also the tightness of our bounds.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes