Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Despite remarkable progress in driving world models, their potential for
autonomous systems remains largely untapped: the world models are mostly
learned for world simulation and decoupled from trajectory planning. While
recent efforts aim to unify world modeling and planning in a single framework,
the synergistic facilitation mechanism of world modeling for planning still
requires further exploration. In this work, we introduce a new driving paradigm
named Policy World Model (PWM), which not only integrates world modeling and
trajectory planning within a unified architecture, but is also able to benefit
planning using the learned world knowledge through the proposed action-free
future state forecasting scheme. Through collaborative state-action prediction,
PWM can mimic the human-like anticipatory perception, yielding more reliable
planning performance. To facilitate the efficiency of video forecasting, we
further introduce a dynamically enhanced parallel token generation mechanism,
equipped with a context-guided tokenizer and an adaptive dynamic focal loss.
Despite utilizing only front camera input, our method matches or exceeds
state-of-the-art approaches that rely on multi-view and multi-modal inputs.
Code and model weights will be released at
https://github.com/6550Zhao/Policy-World-Model.
Authors (5)
Zhida Zhao
Talas Fu
Yifan Wang
Lijun Wang
Huchuan Lu
Submitted
October 22, 2025
Key Contributions
This paper introduces the Policy World Model (PWM), a new driving paradigm that unifies world modeling and trajectory planning. PWM leverages learned world knowledge for planning through action-free future state forecasting and collaborative state-action prediction, enabling human-like anticipatory perception for more reliable planning.
Business Value
Enhances the safety and reliability of autonomous systems, particularly in complex dynamic environments like autonomous driving. This can lead to reduced accidents, improved traffic flow, and more efficient logistics.