CLIRApr 9, 2024

Optimization Methods for Personalizing Large Language Models through Retrieval Augmentation

arXiv:2404.05970v1132 citationsh-index: 12SIGIR
Originality Incremental advance
AI Analysis

This work addresses the challenge of personalizing LLMs for various applications, representing an incremental advancement in retrieval-augmented methods.

The paper tackles the problem of optimizing retrieval models for personalizing large language models (LLMs) by proposing two algorithms that use feedback from downstream generation tasks, achieving statistically significant improvements in six out of seven datasets on the LaMP benchmark.

This paper studies retrieval-augmented approaches for personalizing large language models (LLMs), which potentially have a substantial impact on various applications and domains. We propose the first attempt to optimize the retrieval models that deliver a limited number of personal documents to large language models for the purpose of personalized generation. We develop two optimization algorithms that solicit feedback from the downstream personalized generation tasks for retrieval optimization -- one based on reinforcement learning whose reward function is defined using any arbitrary metric for personalized generation and another based on knowledge distillation from the downstream LLM to the retrieval model. This paper also introduces a pre- and post-generation retriever selection model that decides what retriever to choose for each LLM input. Extensive experiments on diverse tasks from the language model personalization (LaMP) benchmark reveal statistically significant improvements in six out of seven datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes