Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ml 95% Match Research Paper Robotics Researchers,AI Engineers,ML Practitioners 1 day ago

RobustVLA: Robustness-Aware Reinforcement Post-Training for Vision-Language-Action Models

robotics › manipulation
📄 Abstract

Abstract: Vision-Language-Action (VLA) models have recently emerged as powerful general-purpose policies for robotic manipulation, benefiting from large-scale multi-modal pre-training. However, they often fail to generalize reliably in out-of-distribution deployments, where unavoidable disturbances such as observation noise, sensor errors, or actuation perturbations become prevalent. While recent Reinforcement Learning (RL)-based post-training provides a practical means to adapt pre-trained VLA models, existing methods mainly emphasize reward maximization and overlook robustness to environmental uncertainty. In this work, we introduce RobustVLA, a lightweight online RL post-training method designed to explicitly enhance the resilience of VLA models. Through a systematic robustness analysis, we identify two key regularizations: Jacobian regularization, which mitigates sensitivity to observation noise, and smoothness regularization, which stabilizes policies under action perturbations. Extensive experiments across diverse robotic environments demonstrate that RobustVLA significantly outperforms prior state-of-the-art methods in robustness and reliability. Our results highlight the importance of principled robustness-aware RL post-training as a key step toward improving the reliability and robustness of VLA models.
Authors (6)
Hongyin Zhang
Shuo Zhang
Junxi Jin
Qixin Zeng
Runze Li
Donglin Wang
Submitted
November 3, 2025
arXiv Category
cs.RO
arXiv PDF

Key Contributions

Introduces RobustVLA, a lightweight online RL post-training method that explicitly enhances the resilience of Vision-Language-Action (VLA) models. It addresses the generalization failures of VLA models in out-of-distribution deployments by incorporating Jacobian and smoothness regularization to mitigate sensitivity to observation noise and stabilize policies under action perturbations, which is crucial for reliable robotic manipulation.

Business Value

Enhances the reliability and safety of robots operating in real-world, unpredictable environments, leading to more robust and dependable robotic manipulation systems in industries like manufacturing and logistics.