arxiv_ai 88% Match Research Paper Reinforcement learning researchers,AI researchers,Robotics engineers,Cognitive scientists 2 weeks ago

DMWM: Dual-Mind World Model with Long-Term Imagination

reinforcement-learning › multi-agent

📄 Abstract

Abstract: Imagination in world models is crucial for enabling agents to learn long-horizon policy in a sample-efficient manner. Existing recurrent state-space model (RSSM)-based world models depend on single-step statistical inference to capture the environment dynamics, and, hence, they are unable to perform long-term imagination tasks due to the accumulation of prediction errors. Inspired by the dual-process theory of human cognition, we propose a novel dual-mind world model (DMWM) framework that integrates logical reasoning to enable imagination with logical consistency. DMWM is composed of two components: an RSSM-based System 1 (RSSM-S1) component that handles state transitions in an intuitive manner and a logic-integrated neural network-based System 2 (LINN-S2) component that guides the imagination process through hierarchical deep logical reasoning. The inter-system feedback mechanism is designed to ensure that the imagination process follows the logical rules of the real environment. The proposed framework is evaluated on benchmark tasks that require long-term planning from the DMControl suite. Extensive experimental results demonstrate that the proposed framework yields significant improvements in terms of logical coherence, trial efficiency, data efficiency and long-term imagination over the state-of-the-art world models.

Authors (4)

Lingyi Wang

Rashed Shelim

Walid Saad

Naren Ramakrishnan

Submitted

February 11, 2025

arXiv Category

cs.LG

arXiv PDF

Key Contributions

Proposes the Dual-Mind World Model (DMWM) framework, inspired by dual-process theory, which integrates an RSSM-based System 1 for intuitive transitions and a logic-integrated neural network System 2 for guided, logical imagination. This allows for long-term imagination with logical consistency, improving sample efficiency for learning long-horizon policies.

Business Value

Enables AI agents to learn more complex tasks and make better long-term decisions in dynamic environments, leading to more capable autonomous systems in robotics, logistics, and beyond.

Paper Metadata

Innovation Type

Architectural

Deployment Feasibility

The framework integrates existing components (RSSM, LINN) and introduces a novel interaction mechanism, making it potentially deployable in RL systems.

Limitations Addressed

Addresses the inability of existing RSSM-based world models to perform long-term imagination due to accumulated prediction errors, and the lack of logical consistency in such imagination processes.

Technical Tags

world modelsimaginationlong-horizon policysample efficiencyrecurrent state-space models (RSSM)dual-process theorylogical reasoningdual-mind world model (DMWM)system 1system 2LINN-S2

Research Topics

Reinforcement LearningWorld ModelsCognitive ArchitecturesSample EfficiencyLong-Term PlanningLogical Reasoning

Methods & Architectures

Dual-Mind World Model (DMWM) frameworkRSSM-based System 1 (RSSM-S1)Logic-integrated Neural Network-based System 2 (LINN-S2)Inter-system feedback mechanism Recurrent State-Space Model (RSSM)Logic-integrated Neural Network (LINN)Dual-Mind World Model (DMWM)

Applications & Tasks

Robotics Game Playing Autonomous Systems Reinforcement Learning Enabling long-term imagination in world modelsImproving sample efficiency in RLIntegrating logical reasoning with intuitive state transitionsOvercoming prediction error accumulation in RSSMs Learning long-horizon policiesPerforming long-term imaginationAgent decision-making under uncertaintySample-efficient reinforcement learning

Related Fields

Cognitive ScienceArtificial IntelligenceRoboticsMachine Learning Theory

Keywords

world modelimaginationreinforcement learningRSSMdual-process theorylogical reasoningDMWMsample efficiencylong-horizon policysystem 1system 2LINN

Academic Context

#Reinforcement Learning#World Models#Cognitive Architectures#Sample Efficiency#Long-Term Planning#Logical Reasoning

Commercial Potential

Potential Products

More advanced AI agents for complex tasksRobotic systems with enhanced planning capabilitiesSimulation environments for training RL agents

Target Industries

RoboticsAutonomous VehiclesGamingLogisticsAerospace

Use Case Examples

Training robots to perform complex manipulation tasks requiring foresightDeveloping game AI that can strategize over many turnsCreating autonomous systems that can plan long sequences of actions

Competitive Edge

Offers a novel cognitive architecture for world models that explicitly incorporates logical reasoning, potentially outperforming models that rely solely on statistical inference for long-term planning.

Resource Requirements

Compute Needs

Requires significant computational resources for training and running complex world models.

Data Requirements

Environments for reinforcement learning, potentially large datasets for training components.

Deployment Constraints

Integration into RL training pipelines and agent architectures.

Scalability

Scalability depends on the complexity of the environment and the size of the RSSM and LINN components.

Production Readiness

Maturity Level

Research Prototype

View Full Paper Back to Papers