CLApr 9, 2025

A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models

Zhouhang Xie, Junda Wu, Yiran Shen, Yu Xia, Xintong Li, Aaron Chang, Ryan Rossi, Sachin Kumar, Bodhisattwa Prasad Majumder, Jingbo Shang, Prithviraj Ammanabrolu, Julian McAuley

arXiv:2504.07070v123.421 citationsh-index: 22

Originality Synthesis-oriented

AI Analysis

This is an incremental survey paper that organizes existing research for researchers in NLP and personalization.

This survey analyzes personalized preference alignment techniques for large language models, categorizing them into training time, inference time, and user-modeling methods, while discussing their strengths, limitations, evaluation, benchmarks, and open problems.

Personalized preference alignment for large language models (LLMs), the process of tailoring LLMs to individual users' preferences, is an emerging research direction spanning the area of NLP and personalization. In this survey, we present an analysis of works on personalized alignment and modeling for LLMs. We introduce a taxonomy of preference alignment techniques, including training time, inference time, and additionally, user-modeling based methods. We provide analysis and discussion on the strengths and limitations of each group of techniques and then cover evaluation, benchmarks, as well as open problems in the field.

View on arXiv PDF

Similar