LGITMLFeb 28, 2022

KL Divergence Estimation with Multi-group Attribution

arXiv:2202.13576v11 citations
Originality Incremental advance
AI Analysis

This work addresses fairness in divergence estimation for overlapping groups, but it is incremental as it builds on existing multi-calibration methods.

The paper tackles the problem of estimating KL divergence between distributions while accurately attributing contributions to overlapping sub-populations, motivated by multi-group fairness, and shows that multi-group attribution improves estimates on sub-populations compared to other algorithms.

Estimating the Kullback-Leibler (KL) divergence between two distributions given samples from them is well-studied in machine learning and information theory. Motivated by considerations of multi-group fairness, we seek KL divergence estimates that accurately reflect the contributions of sub-populations to the overall divergence. We model the sub-populations coming from a rich (possibly infinite) family $\mathcal{C}$ of overlapping subsets of the domain. We propose the notion of multi-group attribution for $\mathcal{C}$, which requires that the estimated divergence conditioned on every sub-population in $\mathcal{C}$ satisfies some natural accuracy and fairness desiderata, such as ensuring that sub-populations where the model predicts significant divergence do diverge significantly in the two distributions. Our main technical contribution is to show that multi-group attribution can be derived from the recently introduced notion of multi-calibration for importance weights [HKRR18, GRSW21]. We provide experimental evidence to support our theoretical results, and show that multi-group attribution provides better KL divergence estimates when conditioned on sub-populations than other popular algorithms.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes