IRMay 12

TwiSTAR:Think Fast, Think Slow, Then Act,Generative Recommendation with Adaptive Reasoning

Shiteng Cao, Kaian Jiang, Yunlong Gong, Zhiheng Li

arXiv:2605.1155311.11 citations

Predicted impact top 41% in IR · last 90 daysOriginality Incremental advance

AI Analysis

For generative recommendation systems, this work addresses the trade-off between accuracy and latency by adaptively choosing between fast and slow reasoning.

TwiSTAR adaptively allocates reasoning effort per user sequence in generative recommendation, achieving consistent accuracy gains while reducing inference latency compared to uniform slow reasoning across three datasets.

Generative recommendation with Semantic IDs (SIDs) has emerged as a promising paradigm, yet existing methods apply a fixed inference strategy, either fast direct generation or slow chain-of-thought reasoning, uniformly across all user histories. This approach creates a trade-off: fast recommendation model produces suboptimal accuracy on hard samples, while always invoking slow reasoning incurs prohibitive latency and wastes computation on easy cases. To address this, we propose Think Fast, Think Slow, Then Act, a framework that learns to adaptively allocate reasoning effort per user sequence. Our system equips an LLM with three complementary tools: a fast SID-based retriever, a lightweight candidate ranker, and a slow reasoning model that generates explicit rationales before recommending. Crucially, we inject collaborative commonsense into the slow model by transforming item-to-item knowledge into natural language explanations. A planner, trained through supervised warm-up followed by agentic reinforcement learning, dynamically decides which tool to invoke. Experiments on three datasets demonstrate that our method outperforms strong baselines, achieving consistent accuracy gains while reducing inference latency compared to uniform slow reasoning.

View on arXiv PDF

Similar