CLAIApr 20

Latent Preference Modeling for Cross-Session Personalized Tool Calling

arXiv:2604.1788681.61 citationsh-index: 1
Predicted impact top 65% in CL · last 90 daysOriginality Incremental advance
AI Analysis

For developers of LLM-based agents, this work addresses the practical problem of under-specified user requests by enabling efficient personalization through memory of user preferences.

The paper introduces MPT, a benchmark for personalized tool calling in multi-session dialogues, and proposes PRefine, a memory-augmented method that improves tool-calling accuracy while using only 1.24% of the tokens required by full-history prompting.

Users often omit essential details in their requests to LLM-based agents, resulting in under-specified inputs for tool use. This poses a fundamental challenge for tool-augmented agents, as API execution typically requires complete arguments, highlighting the need for personalized tool calling. To study this problem, we introduce MPT, a benchmark comprising 265 multi-session dialogues that cover three challenges: Preference Recall, Preference Induction, and Preference Transfer. We also propose PRefine, a test-time memory-augmented method that represents user preferences as evolving hypotheses. Through a generate--verify--refine loop, it extracts reusable constraints from history and improves tool-calling accuracy while using only 1.24% of the tokens required by full-history prompting. These results indicate that robust personalization in agentic systems depends on memory that captures the reasons behind user choices, not just the choices themselves.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes