Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ai 80% Match Research Paper AI researchers,ML engineers,Developers of multimodal AI systems,Researchers in LLMs and diffusion models 2 weeks ago

UniRL-Zero: Reinforcement Learning on Unified Models with Joint Language Model and Diffusion Model Experts

reinforcement-learning › multi-agent
📄 Abstract

Abstract: We present UniRL-Zero, a unified reinforcement learning (RL) framework that boosts, multimodal language model understanding and reasoning, diffusion model multimedia generation, and their beneficial interaction capabilities within a unified model. Our work defines six scenarios for unified model reinforcement learning, providing systematic baselines for reinforcement learning of unified understanding and generation model. Our code is available at https://github.com/G-U-N/UniRL.
Authors (5)
Fu-Yun Wang
Han Zhang
Michael Gharbi
Hongsheng Li
Taesung Park
Submitted
October 20, 2025
arXiv Category
cs.LG
arXiv PDF Code

Key Contributions

UniRL-Zero presents a unified reinforcement learning framework designed to enhance both multimodal language model understanding/reasoning and diffusion model multimedia generation. It defines six scenarios for unified model RL, providing systematic baselines for training models that excel in both understanding and generation tasks, fostering beneficial interactions between modalities.

Business Value

Paves the way for more versatile and powerful AI systems capable of both understanding complex information and generating rich multimedia content, leading to enhanced creative tools and more intelligent agents.

View Code on GitHub