arxiv_cv 95% Match Research Paper AI researchers in autonomous driving,Robotics engineers,RL practitioners,Simulation developers 2 weeks ago

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

reinforcement-learning › offline-rl

📄 Abstract

Abstract: Existing end-to-end autonomous driving (AD) algorithms typically follow the Imitation Learning (IL) paradigm, which faces challenges such as causal confusion and an open-loop gap. In this work, we propose RAD, a 3DGS-based closed-loop Reinforcement Learning (RL) framework for end-to-end Autonomous Driving. By leveraging 3DGS techniques, we construct a photorealistic digital replica of the real physical world, enabling the AD policy to extensively explore the state space and learn to handle out-of-distribution scenarios through large-scale trial and error. To enhance safety, we design specialized rewards to guide the policy in effectively responding to safety-critical events and understanding real-world causal relationships. To better align with human driving behavior, we incorporate IL into RL training as a regularization term. We introduce a closed-loop evaluation benchmark consisting of diverse, previously unseen 3DGS environments. Compared to IL-based methods, RAD achieves stronger performance in most closed-loop metrics, particularly exhibiting a 3x lower collision rate. Abundant closed-loop results are presented in the supplementary material. Code is available at https://github.com/hustvl/RAD for facilitating future research.

Authors (14)

Hao Gao

Shaoyu Chen

Bo Jiang

Bencheng Liao

Yiang Shi

Xiaoyang Guo

+8 more

Submitted

February 18, 2025

arXiv Category

cs.CV

arXiv PDF

Key Contributions

Introduces RAD, a closed-loop Reinforcement Learning framework for end-to-end autonomous driving, leveraging 3D Gaussian Splatting (3DGS) for photorealistic simulation. This allows for extensive state space exploration and learning to handle out-of-distribution scenarios, while specialized rewards enhance safety and IL regularization aligns with human behavior.

Business Value

Accelerates the development and validation of safer and more robust autonomous driving systems by enabling large-scale, realistic simulation-based training.

Paper Metadata

Innovation Type

Simulation-based RL Framework

Deployment Feasibility

Feasible for training and testing. Real-world deployment requires extensive validation and integration with vehicle hardware.

Limitations Addressed

Causal confusion and open-loop gap inherent in Imitation Learning, limited exploration, and difficulty in handling unseen scenarios in autonomous driving.

Technical Tags

end-to-end autonomous drivingreinforcement learning (RL)3DGS (3D Gaussian Splatting)closed-loop simulationimitation learning (IL)causal confusionopen-loop gapphotorealistic digital replicastate space explorationsafety-critical events

Research Topics

Autonomous DrivingReinforcement LearningSimulationImitation LearningRobotics

Methods & Architectures

RAD framework3D Gaussian Splatting (3DGS) for simulationClosed-loop Reinforcement LearningSpecialized rewards for safetyIL as regularization Reinforcement Learning Policy Network

Applications & Tasks

Autonomous Vehicles Robotics Simulation Environments Causal confusion and open-loop gap in Imitation Learning for ADLimited exploration of state spaceDifficulty handling out-of-distribution scenariosEnsuring safety in autonomous driving policies End-to-end autonomous driving policy trainingImproving safety and robustnessHandling out-of-distribution scenariosLearning causal relationships

Datasets & Benchmarks

Benchmarks

closed-loop evaluation benchmark consisting of diverse, previously unseen 3DGS environments

Related Fields

RoboticsComputer VisionReinforcement LearningSimulationAutonomous Systems

Keywords

autonomous drivingreinforcement learning3D Gaussian Splattingsimulationclosed-loopimitation learningroboticsdeep learningpolicy learningsafetyout-of-distributioncausal inferencedriving simulation

Academic Context

#Autonomous Driving#Reinforcement Learning#Simulation#Imitation Learning#Robotics

Commercial Potential

Potential Products

Autonomous driving simulation platformsRL training frameworks for ADTools for validating AD policies

Target Industries

AutomotiveTechnologyRoboticsSimulation Software

Use Case Examples

Training self-driving car policies in diverse and challenging simulated environmentsTesting AD systems for edge cases and safety-critical scenariosAccelerating the development cycle of autonomous vehicles

Competitive Edge

Offers a closed-loop RL approach using advanced 3DGS simulation, addressing limitations of traditional IL methods and enabling more robust and safer autonomous driving policies.

Market Opportunity

Massive market for autonomous driving technology.

Revenue Models

Licensing the AD software stackproviding simulation servicesdeveloping specialized AD components.

Resource Requirements

Compute Needs

Very High (for training RL policies in large-scale 3DGS simulations)

Data Requirements

3DGS scene data, potentially real-world driving data for IL regularization.

Deployment Constraints

Computational resources for simulation, transferability from simulation to real world (sim-to-real gap), safety validation.

Scalability

Scalability depends on the simulation environment's ability to generate diverse scenarios and the RL training infrastructure.

Regulatory Considerations

Safety standards for autonomous vehiclesdata privacy in simulation.

Production Readiness

Maturity Level

Research/Development

Time to Market

3-7 years (for full integration and validation in production vehicles)

Patent Potential

Moderate (novel framework and simulation integration)

View Full Paper Back to Papers