AIMay 26

Multi-Stakeholder LLM Alignment: Decomposing Estimation from Aggregation

Lulu Zheng, Wenjin Yang, Xiangwen Zhang, Rong Yin, Yulan Hu, Zheng Pan, Xin Li

arXiv:2605.2687881.0

Predicted impact top 34% in AI · last 90 daysOriginality Incremental advance

AI Analysis

For multi-stakeholder alignment tasks, this work identifies and mitigates a previously overlooked source of instability in LLM-based preference aggregation.

The paper shows that holistic LLM judges conflate utility estimation and aggregation, causing unstable weighting noise that grows with stakeholder count. The proposed DecompR method fixes weights before scoring and estimates utilities independently, reducing noise.

Multi-stakeholder tasks require one output to satisfy users with conflicting preferences. Holistic LLM judges conflate utility estimation and utility aggregation, yielding unstable implicit weights. We show empirically and theoretically that this aggregation-specific \emph{weighting noise} can create large score shifts when stakeholder satisfaction is dispersed; in our experiments, these weight-induced shifts also increase with stakeholder count. We propose \textsc{DecompR}: counterfactual-calibrated weights are fixed from query structure before candidate scoring, while per-role utilities are estimated independently, removing candidate-dependent weight drift and reducing estimation noise.

View on arXiv PDF

Similar