Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ai 88% Match Research Paper Reinforcement learning researchers,AI researchers,Robotics engineers,Cognitive scientists 2 weeks ago

DMWM: Dual-Mind World Model with Long-Term Imagination

reinforcement-learning › multi-agent
📄 Abstract

Abstract: Imagination in world models is crucial for enabling agents to learn long-horizon policy in a sample-efficient manner. Existing recurrent state-space model (RSSM)-based world models depend on single-step statistical inference to capture the environment dynamics, and, hence, they are unable to perform long-term imagination tasks due to the accumulation of prediction errors. Inspired by the dual-process theory of human cognition, we propose a novel dual-mind world model (DMWM) framework that integrates logical reasoning to enable imagination with logical consistency. DMWM is composed of two components: an RSSM-based System 1 (RSSM-S1) component that handles state transitions in an intuitive manner and a logic-integrated neural network-based System 2 (LINN-S2) component that guides the imagination process through hierarchical deep logical reasoning. The inter-system feedback mechanism is designed to ensure that the imagination process follows the logical rules of the real environment. The proposed framework is evaluated on benchmark tasks that require long-term planning from the DMControl suite. Extensive experimental results demonstrate that the proposed framework yields significant improvements in terms of logical coherence, trial efficiency, data efficiency and long-term imagination over the state-of-the-art world models.
Authors (4)
Lingyi Wang
Rashed Shelim
Walid Saad
Naren Ramakrishnan
Submitted
February 11, 2025
arXiv Category
cs.LG
arXiv PDF

Key Contributions

Proposes the Dual-Mind World Model (DMWM) framework, inspired by dual-process theory, which integrates an RSSM-based System 1 for intuitive transitions and a logic-integrated neural network System 2 for guided, logical imagination. This allows for long-term imagination with logical consistency, improving sample efficiency for learning long-horizon policies.

Business Value

Enables AI agents to learn more complex tasks and make better long-term decisions in dynamic environments, leading to more capable autonomous systems in robotics, logistics, and beyond.