CLAILGDec 19, 2023

Active Preference Inference using Language Models and Probabilistic Reasoning

arXiv:2312.12009v231 citationsh-index: 3
Originality Incremental advance
AI Analysis

This work addresses the need for more efficient personalization in human-facing AI systems, though it is incremental as it builds on existing LLM capabilities with a specific algorithmic enhancement.

The paper tackled the problem of inefficient preference inference by large language models (LLMs) in interactive systems, introducing an algorithm that uses probabilistic reasoning to generate more informative questions, resulting in improved task performance with fewer user interactions in a web shopping setting.

Actively inferring user preferences, for example by asking good questions, is important for any human-facing decision-making system. Active inference allows such systems to adapt and personalize themselves to nuanced individual preferences. To enable this ability for instruction-tuned large language models (LLMs), one may prompt them to ask users questions to infer their preferences, transforming the language models into more robust, interactive systems. However, out of the box, these models are not efficient at extracting preferences: the questions they generate are not informative, requiring a high number of user interactions and impeding the usability of the downstream system. In this work, we introduce an inference-time algorithm that helps LLMs quickly infer preferences by using more informative questions. Our algorithm uses a probabilistic model whose conditional distributions are defined by prompting an LLM, and returns questions that optimize expected entropy and expected model change. Results in a simplified interactive web shopping setting with real product items show that an LLM equipped with our entropy reduction algorithm outperforms baselines with the same underlying LLM on task performance while using fewer user interactions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes