DMWM: Dual-Mind World Model with Long-Term Imagination
This work addresses the problem of long-term planning in sample-efficient learning for agents, which is crucial for tasks that require reasoning and imagination over extended periods.
The authors tackled the problem of long-term imagination in world models, achieving significant improvements in logical coherence, trial efficiency, data efficiency, and long-term imagination with their proposed DMWM framework. The framework outperformed state-of-the-art world models on benchmark tasks from the DMControl suite.
Imagination in world models is crucial for enabling agents to learn long-horizon policy in a sample-efficient manner. Existing recurrent state-space model (RSSM)-based world models depend on single-step statistical inference to capture the environment dynamics, and, hence, they are unable to perform long-term imagination tasks due to the accumulation of prediction errors. Inspired by the dual-process theory of human cognition, we propose a novel dual-mind world model (DMWM) framework that integrates logical reasoning to enable imagination with logical consistency. DMWM is composed of two components: an RSSM-based System 1 (RSSM-S1) component that handles state transitions in an intuitive manner and a logic-integrated neural network-based System 2 (LINN-S2) component that guides the imagination process through hierarchical deep logical reasoning. The inter-system feedback mechanism is designed to ensure that the imagination process follows the logical rules of the real environment. The proposed framework is evaluated on benchmark tasks that require long-term planning from the DMControl suite. Extensive experimental results demonstrate that the proposed framework yields significant improvements in terms of logical coherence, trial efficiency, data efficiency and long-term imagination over the state-of-the-art world models.