AICLJan 29

System 1&2 Synergy via Dynamic Model Interpolation

arXiv:2601.21414v14 citationsh-index: 9
Originality Incremental advance
AI Analysis

This work addresses the problem of cognitive mode interference in language models for researchers and practitioners, offering a novel approach to combine efficiency and reasoning depth, though it is incremental in building on existing checkpoints.

The paper tackles the challenge of training a unified language model that adapts between intuitive System 1 and deliberative System 2 modes by shifting focus from output control to capability control, using dynamic parameter interpolation without additional training. The proposed DAMI framework achieves higher accuracy than the Thinking model while maintaining efficiency on five mathematical reasoning benchmarks.

Training a unified language model that adapts between intuitive System 1 and deliberative System 2 remains challenging due to interference between their cognitive modes. Recent studies have thus pursued making System 2 models more efficient. However, these approaches focused on output control, limiting what models produce. We argue that this paradigm is misaligned: output length is merely a symptom of the model's cognitive configuration, not the root cause. In this work, we shift the focus to capability control, which modulates \textit{how models think} rather than \textit{what they produce}. To realize this, we leverage existing Instruct and Thinking checkpoints through dynamic parameter interpolation, without additional training. Our pilot study establishes that linear interpolation yields a convex, monotonic Pareto frontier, underpinned by representation continuity and structural connectivity. Building on this, we propose \textbf{DAMI} (\textbf{D}yn\textbf{A}mic \textbf{M}odel \textbf{I}nterpolation), a framework that estimates a query-specific Reasoning Intensity $λ(q)$ to configure cognitive depth. For training-based estimation, we develop a preference learning method encoding accuracy and efficiency criteria. For zero-shot deployment, we introduce a confidence-based method leveraging inter-model cognitive discrepancy. Experiments on five mathematical reasoning benchmarks demonstrate that DAMI achieves higher accuracy than the Thinking model while remaining efficient, effectively combining the efficiency of System 1 with the reasoning depth of System 2.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes