Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ml 95% Match Research Paper Reinforcement Learning Researchers,Generative AI Researchers,Robotics Engineers,AI Scientists 1 day ago

Bellman Diffusion Models

generative-ai › diffusion
📄 Abstract

Abstract: Diffusion models have seen tremendous success as generative architectures. Recently, they have been shown to be effective at modelling policies for offline reinforcement learning and imitation learning. We explore using diffusion as a model class for the successor state measure (SSM) of a policy. We find that enforcing the Bellman flow constraints leads to a simple Bellman update on the diffusion step distribution.
Authors (2)
Liam Schramm
Abdeslam Boularias
Submitted
July 16, 2024
arXiv Category
cs.LG
arXiv PDF

Key Contributions

This paper explores using diffusion models to represent the successor state measure (SSM) for policies in offline reinforcement learning and imitation learning. By enforcing Bellman flow constraints, they derive a simple Bellman update on the diffusion step distribution, integrating generative power with RL principles.

Business Value

Could lead to more robust and capable AI agents in domains like robotics and autonomous systems by improving how policies are learned and represented, especially from offline data.