CLMar 11, 2025

Group Preference Alignment: Customized LLM Response Generation from In-Situ Conversations

arXiv:2503.08035v11 citationsh-index: 14
AI Analysis

This addresses personalization for distinct user groups in LLMs, though it is incremental as it builds on existing personalization and alignment techniques.

The paper tackles the problem of LLMs failing to meet specialized user group needs by proposing Group Preference Alignment (GPA), a framework that extracts group-specific preferences from conversations and steers LLMs to address them, resulting in significantly improved alignment with user preferences and outperforming baseline methods.

LLMs often fail to meet the specialized needs of distinct user groups due to their one-size-fits-all training paradigm \cite{lucy-etal-2024-one} and there is limited research on what personalization aspects each group expect. To address these limitations, we propose a group-aware personalization framework, Group Preference Alignment (GPA), that identifies context-specific variations in conversational preferences across user groups and then steers LLMs to address those preferences. Our approach consists of two steps: (1) Group-Aware Preference Extraction, where maximally divergent user-group preferences are extracted from real-world conversation logs and distilled into interpretable rubrics, and (2) Tailored Response Generation, which leverages these rubrics through two methods: a) Context-Tuned Inference (GAP-CT), that dynamically adjusts responses via context-dependent prompt instructions, and b) Rubric-Finetuning Inference (GPA-FT), which uses the rubrics to generate contrastive synthetic data for personalization of group-specific models via alignment. Experiments demonstrate that our framework significantly improves alignment of the output with respect to user preferences and outperforms baseline methods, while maintaining robust performance on standard benchmarks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes