IR LGMay 30, 2022

Uncertainty Quantification for Fairness in Two-Stage Recommender Systems

arXiv:2205.15436v314.227 citationsh-index: 74Has Code

Originality Highly original

AI Analysis

This addresses fairness issues for items in large-scale recommender systems, offering a novel solution to prevent irrecoverable unfairness in the first stage, though it is incremental in applying uncertainty quantification to this specific bottleneck.

The paper tackles the problem of ensuring group fairness in two-stage recommender systems, where first-stage recommenders may select unfair candidate sets, by proposing two threshold-policy selection rules that provide distribution-free and finite-sample fairness guarantees, resulting in near-optimal candidate sets with enough relevant items from each group while minimizing size.

Many large-scale recommender systems consist of two stages. The first stage efficiently screens the complete pool of items for a small subset of promising candidates, from which the second-stage model curates the final recommendations. In this paper, we investigate how to ensure group fairness to the items in this two-stage architecture. In particular, we find that existing first-stage recommenders might select an irrecoverably unfair set of candidates such that there is no hope for the second-stage recommender to deliver fair recommendations. To this end, motivated by recent advances in uncertainty quantification, we propose two threshold-policy selection rules that can provide distribution-free and finite-sample guarantees on fairness in first-stage recommenders. More concretely, given any relevance model of queries and items and a point-wise lower confidence bound on the expected number of relevant items for each threshold-policy, the two rules find near-optimal sets of candidates that contain enough relevant items in expectation from each group of items. To instantiate the rules, we demonstrate how to derive such confidence bounds from potentially partial and biased user feedback data, which are abundant in many large-scale recommender systems. In addition, we provide both finite-sample and asymptotic analyses of how close the two threshold selection rules are to the optimal thresholds. Beyond this theoretical analysis, we show empirically that these two rules can consistently select enough relevant items from each group while minimizing the size of the candidate sets for a wide range of settings.

View on arXiv PDF Code

Similar