Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Vision-Language-Action (VLA) models have recently emerged as powerful
general-purpose policies for robotic manipulation, benefiting from large-scale
multi-modal pre-training. However, they often fail to generalize reliably in
out-of-distribution deployments, where unavoidable disturbances such as
observation noise, sensor errors, or actuation perturbations become prevalent.
While recent Reinforcement Learning (RL)-based post-training provides a
practical means to adapt pre-trained VLA models, existing methods mainly
emphasize reward maximization and overlook robustness to environmental
uncertainty. In this work, we introduce RobustVLA, a lightweight online RL
post-training method designed to explicitly enhance the resilience of VLA
models. Through a systematic robustness analysis, we identify two key
regularizations: Jacobian regularization, which mitigates sensitivity to
observation noise, and smoothness regularization, which stabilizes policies
under action perturbations. Extensive experiments across diverse robotic
environments demonstrate that RobustVLA significantly outperforms prior
state-of-the-art methods in robustness and reliability. Our results highlight
the importance of principled robustness-aware RL post-training as a key step
toward improving the reliability and robustness of VLA models.
Authors (6)
Hongyin Zhang
Shuo Zhang
Junxi Jin
Qixin Zeng
Runze Li
Donglin Wang
Submitted
November 3, 2025
Key Contributions
Introduces RobustVLA, a lightweight online RL post-training method that explicitly enhances the resilience of Vision-Language-Action (VLA) models. It addresses the generalization failures of VLA models in out-of-distribution deployments by incorporating Jacobian and smoothness regularization to mitigate sensitivity to observation noise and stabilize policies under action perturbations, which is crucial for reliable robotic manipulation.
Business Value
Enhances the reliability and safety of robots operating in real-world, unpredictable environments, leading to more robust and dependable robotic manipulation systems in industries like manufacturing and logistics.