LGAICLJun 26, 2024

Few-shot Personalization of LLMs with Mis-aligned Responses

arXiv:2406.18678v231 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of providing personalized responses in LLMs for users with diverse backgrounds, though it is incremental as it builds on existing prompt-based personalization approaches.

The paper tackles the problem of personalizing large language models (LLMs) for diverse users by proposing Fermi, a method that learns personalized prompts using user profiles and a few opinion examples, incorporating mis-aligned responses to improve personalization. It shows significant performance improvements over baselines across various benchmarks.

As the diversity of users increases, the capability of providing personalized responses by large language models (LLMs) has become increasingly important. Existing approaches have only limited successes in LLM personalization, due to the absence of personalized learning or the reliance on shared personal data. This paper proposes a new approach for a few-shot personalization of LLMs with their mis-aligned responses (Fermi). Our key idea is to learn a set of personalized prompts for each user by progressively improving the prompts using LLMs, based on user profile (e.g., demographic information) and a few examples of previous opinions. During an iterative process of prompt improvement, we incorporate the contexts of mis-aligned responses by LLMs, which are especially crucial for the effective personalization of LLMs. In addition, we develop an effective inference method to further leverage the context of the test query and the personalized prompts. Our experimental results demonstrate that Fermi significantly improves performance across various benchmarks, compared to best-performing baselines.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes