CLOct 25, 2025

SteerX: Disentangled Steering for LLM Personalization

arXiv:2510.22256v17 citationsh-index: 18
Originality Incremental advance
AI Analysis

This work provides a more effective solution for personalizing LLMs in applications like intelligent assistants, though it is incremental as it builds on existing steering methods.

The paper tackled the problem of personalizing large language models (LLMs) by addressing noise in activation steering from irrelevant historical data, resulting in improved steering vector quality and enhanced personalization as demonstrated in experiments on real-world datasets.

Large language models (LLMs) have shown remarkable success in recent years, enabling a wide range of applications, including intelligent assistants that support users' daily life and work. A critical factor in building such assistants is personalizing LLMs, as user preferences and needs vary widely. Activation steering, which directly leverages directions representing user preference in the LLM activation space to adjust its behavior, offers a cost-effective way to align the model's outputs with individual users. However, existing methods rely on all historical data to compute the steering vector, ignoring that not all content reflects true user preferences, which undermines the personalization signal. To address this, we propose SteerX, a disentangled steering method that isolates preference-driven components from preference-agnostic components. Grounded in causal inference theory, SteerX estimates token-level causal effects to identify preference-driven tokens, transforms these discrete signals into a coherent description, and then leverages them to steer personalized LLM generation. By focusing on the truly preference-driven information, SteerX produces more accurate activation steering vectors and enhances personalization. Experiments on two representative steering backbone methods across real-world datasets demonstrate that SteerX consistently enhances steering vector quality, offering a practical solution for more effective LLM personalization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes