arxiv_ai 80% Match Research Paper RL Researchers,Robotics Engineers,AI Researchers,Cognitive Scientists 1 week ago

Multimodal Dreaming: A Global Workspace Approach to World Model-Based Reinforcement Learning

reinforcement-learning › robotics-rl

📄 Abstract

Abstract: Humans leverage rich internal models of the world to reason about the future, imagine counterfactuals, and adapt flexibly to new situations. In Reinforcement Learning (RL), world models aim to capture how the environment evolves in response to the agent's actions, facilitating planning and generalization. However, typical world models directly operate on the environment variables (e.g. pixels, physical attributes), which can make their training slow and cumbersome; instead, it may be advantageous to rely on high-level latent dimensions that capture relevant multimodal variables. Global Workspace (GW) Theory offers a cognitive framework for multimodal integration and information broadcasting in the brain, and recent studies have begun to introduce efficient deep learning implementations of GW. Here, we evaluate the capabilities of an RL system combining GW with a world model. We compare our GW-Dreamer with various versions of the standard PPO and the original Dreamer algorithms. We show that performing the dreaming process (i.e., mental simulation) inside the GW latent space allows for training with fewer environment steps. As an additional emergent property, the resulting model (but not its comparison baselines) displays strong robustness to the absence of one of its observation modalities (images or simulation attributes). We conclude that the combination of GW with World Models holds great potential for improving decision-making in RL agents.

Authors (3)

Léopold Maytié

Roland Bertin Johannet

Rufin VanRullen

Submitted

February 28, 2025

arXiv Category

cs.AI

arXiv PDF

Key Contributions

Introduces GW-Dreamer, an RL system combining Global Workspace (GW) Theory with a world model. This approach leverages high-level latent dimensions for multimodal integration and information broadcasting, aiming to improve planning, generalization, and the ability to imagine counterfactuals, outperforming standard PPO and the original Dreamer algorithm.

Business Value

Enables the development of more sophisticated RL agents capable of complex planning and adaptation, crucial for advanced robotics, autonomous vehicles, and intelligent decision-making systems.

Paper Metadata

Innovation Type

Integration of Cognitive Theory with RL World Models

Deployment Feasibility

Requires significant computational resources for training complex world models and GW architectures. Feasibility depends on hardware and algorithmic efficiency improvements.

Limitations Addressed

Addresses the challenges of slow and cumbersome training of traditional world models by utilizing high-level latent dimensions and a cognitive architecture (GW) for more efficient multimodal integration and information processing.

Performance Gains

GW-Dreamer compared favorably against PPO and Dreamer.

Technical Tags

world modelsreinforcement learningGlobal Workspace (GW) Theorymultimodal integrationplanninggeneralizationlatent dimensionsdeep learningimaginationcounterfactuals

Research Topics

Reinforcement LearningWorld ModelsCognitive ArchitecturesDeep LearningArtificial Intelligence

Methods & Architectures

Global Workspace (GW) integration with world modelComparison with PPO and Dreamer algorithms GW-Dreamer

Applications & Tasks

Robotics Autonomous Systems Game AI Simulation Slow and cumbersome training of world modelsLeveraging high-level latent dimensionsImproving planning and generalization in RL Learning world modelsEnabling planning and imagination in RL agentsMultimodal information integration

Related Fields

Artificial IntelligenceReinforcement LearningCognitive ScienceRoboticsMachine Learning

Keywords

reinforcement learningworld modelsGlobal Workspace Theorymultimodalplanninggeneralizationlatent dimensionscognitive architectureimaginationGW-DreamerRL

Academic Context

#Reinforcement Learning#World Models#Cognitive Architectures#Deep Learning#Artificial Intelligence

Commercial Potential

Potential Products

Advanced autonomous systemsRobots with enhanced planning capabilitiesAI agents for complex simulations

Target Industries

RoboticsAutomotiveAerospaceGamingLogistics

Use Case Examples

Training robots to perform complex manipulation tasksDeveloping autonomous vehicles that can plan long-term routesCreating AI agents that can adapt to dynamic environments

Competitive Edge

Integrates principles from cognitive science (GW Theory) into RL world models, offering a potentially more efficient and capable approach to planning and generalization compared to existing methods like PPO and Dreamer.

Resource Requirements

Compute Needs

High compute requirements for training complex world models and RL agents.

Data Requirements

Requires interaction data from environments (simulated or real) for training.

Deployment Constraints

Computational cost, sample efficiency, and the complexity of integrating GW theory effectively.

Scalability

Scalability depends on the efficiency of the world model and GW implementation.

View Full Paper Back to Papers