AILGFeb 3

Mitigating Conversational Inertia in Multi-Turn Agents

arXiv:2602.03664v13 citationsh-index: 2
Originality Incremental advance
AI Analysis

This addresses a specific bottleneck in deploying LLMs as agents, offering an incremental improvement for multi-turn interaction scenarios.

The paper tackles the problem of conversational inertia in multi-turn agents, where LLMs mimic their own previous responses, and proposes Context Preference Learning to reduce this bias, achieving performance improvements across eight agentic environments.

Large language models excel as few-shot learners when provided with appropriate demonstrations, yet this strength becomes problematic in multiturn agent scenarios, where LLMs erroneously mimic their own previous responses as few-shot examples. Through attention analysis, we identify conversational inertia, a phenomenon where models exhibit strong diagonal attention to previous responses, which is associated with imitation bias that constrains exploration. This reveals a tension when transforming few-shot LLMs into agents: longer context enriches environmental feedback for exploitation, yet also amplifies conversational inertia that undermines exploration. Our key insight is that for identical states, actions generated with longer contexts exhibit stronger inertia than those with shorter contexts, enabling construction of preference pairs without environment rewards. Based on this, we propose Context Preference Learning to calibrate model preferences to favor low-inertia responses over highinertia ones. We further provide context management strategies at inference time to balance exploration and exploitation. Experimental results across eight agentic environments and one deep research scenario validate that our framework reduces conversational inertia and achieves performance improvements.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes