Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: World models have recently attracted growing interest in Multi-Agent
Reinforcement Learning (MARL) due to their ability to improve sample efficiency
for policy learning. However, accurately modeling environments in MARL is
challenging due to the exponentially large joint action space and highly
uncertain dynamics inherent in multi-agent systems. To address this, we reduce
modeling complexity by shifting from jointly modeling the entire state-action
transition dynamics to focusing on the state space alone at each timestep
through sequential agent modeling. Specifically, our approach enables the model
to progressively resolve uncertainty while capturing the structured
dependencies among agents, providing a more accurate representation of how
agents influence the state. Interestingly, this sequential revelation of
agents' actions in a multi-agent system aligns with the reverse process in
diffusion models--a class of powerful generative models known for their
expressiveness and training stability compared to autoregressive or latent
variable models. Leveraging this insight, we develop a flexible and robust
world model for MARL using diffusion models. Our method, Diffusion-Inspired
Multi-Agent world model (DIMA), achieves state-of-the-art performance across
multiple multi-agent control benchmarks, significantly outperforming prior
world models in terms of final return and sample efficiency, including MAMuJoCo
and Bi-DexHands. DIMA establishes a new paradigm for constructing multi-agent
world models, advancing the frontier of MARL research. Codes are open-sourced
at https://github.com/breez3young/DIMA.
Authors (8)
Yang Zhang
Xinran Li
Jianing Ye
Shuang Qiu
Delin Qu
Xiu Li
+2 more
Key Contributions
Proposes a diffusion-inspired approach to MARL world modeling that focuses on progressively modeling the state space rather than joint state-action dynamics. This method reduces complexity, progressively resolves uncertainty, and captures structured agent dependencies, aligning with the reverse process of diffusion models.
Business Value
Enhances the development of more capable multi-agent systems (e.g., swarms of robots, complex game AIs) by improving learning efficiency and robustness in uncertain environments, leading to better performance in collaborative tasks.