arxiv_ai 95% Match Research Paper Robotics Researchers,ML Engineers,AI Researchers in Manipulation,Autonomous Systems Developers 2 weeks ago

RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation

robotics › manipulation

📄 Abstract

Abstract: Vision-Language-Action models (VLAs) have demonstrated remarkable performance on complex robotic manipulation tasks through imitation learning. However, existing imitation learning datasets contain only successful trajectories and lack failure or recovery data, especially for out-of-distribution (OOD) states where the robot deviates from the main policy due to minor perturbations or errors, leading VLA models to struggle with states deviating from the training distribution. To this end, we propose an automated OOD data augmentation framework named RESample through exploratory sampling. Specifically, we first leverage offline reinforcement learning to obtain an action-value network that accurately identifies sub-optimal actions under the current manipulation policy. We further sample potential OOD states from trajectories via rollout, and design an exploratory sampling mechanism that adaptively incorporates these action proxies into the training dataset to ensure efficiency. Subsequently, our framework explicitly encourages the VLAs to recover from OOD states and enhances their robustness against distributional shifts. We conduct extensive experiments on the LIBERO benchmark as well as real-world robotic manipulation tasks, demonstrating that RESample consistently improves the stability and generalization ability of VLA models.

Authors (8)

Yuquan Xue

Guanxing Lu

Zhenyu Wu

Chuanrui Zhang

Bofang Jia

Zhengyi Gu

+2 more

Submitted

October 20, 2025

arXiv Category

cs.RO

arXiv PDF

Key Contributions

Proposes RESample, an automated Out-of-Distribution (OOD) data augmentation framework for robotic manipulation that uses exploratory sampling. By leveraging offline RL to identify sub-optimal actions and adaptively incorporating sampled OOD states, RESample enhances the robustness of Vision-Language-Action models to deviations from the training distribution.

Business Value

Enables the development of more reliable and adaptable robots that can handle unexpected situations and recover from errors, crucial for safe and efficient deployment in real-world environments.

Paper Metadata

Innovation Type

Methodological

Deployment Feasibility

Moderate to High. Requires integration into the robot's learning pipeline and potentially access to simulation or real-world rollouts. The offline RL component adds complexity.

Limitations Addressed

Addresses the critical limitation of imitation learning datasets lacking failure and recovery data, which causes VLA models to fail in out-of-distribution states. RESample generates relevant OOD data to improve model robustness.

Performance Gains

Enhances robustness of VLA models to OOD states; improves performance on states deviating from the training distribution.

Technical Tags

data augmentationrobotic manipulationvision-language-action modelsimitation learningout-of-distribution (OOD) statesexploratory samplingoffline reinforcement learningaction-value networkrobustnessfailure recovery

Research Topics

RoboticsImitation LearningReinforcement LearningData AugmentationAI Robustness

Methods & Architectures

RESample frameworkExploratory samplingOffline reinforcement learningAction-value networkTrajectory samplingAdaptive incorporation of action proxies Vision-Language-Action (VLA) modelsAction-value network

Applications & Tasks

Robotics Industrial Automation Autonomous Systems Lack of failure/recovery data in imitation learning datasetsVLA model struggles with OOD statesImproving robustness to perturbations and errorsEfficient OOD data augmentation Robotic manipulationImitation learning for complex tasksImproving VLA model generalization

Related Fields

RoboticsImitation LearningReinforcement LearningComputer VisionMachine LearningAI Safety

Keywords

roboticsmanipulationdata augmentationOODimitation learningVLAoffline RLrobustnessexploratory samplingfailure recovery

Academic Context

#Robotics#Imitation Learning#Reinforcement Learning#Data Augmentation#AI Robustness

Commercial Potential

Potential Products

Robotic control software with enhanced OOD handlingSimulation environments for training robust robotsData augmentation services for robotics

Target Industries

RoboticsManufacturingLogisticsAutonomous VehiclesAerospace

Use Case Examples

Training a robot arm to pick and place objects even when the object is slightly displaced or partially occluded.Enabling a robot to recover from a dropped item or an unexpected collision.Improving the generalization of robots in unstructured home or warehouse environments.

Competitive Edge

Offers a novel, automated approach to data augmentation specifically for OOD states in robotic manipulation, addressing a key bottleneck in imitation learning for robust real-world deployment.

Market Opportunity

Large and growing market for advanced robotics and automation.

Revenue Models

Licensing of RESample frameworkintegration services for robotic companies.

Resource Requirements

Compute Needs

High (for offline RL training and extensive rollouts/sampling)

Data Requirements

Existing imitation learning datasets, potentially simulation environments for generating rollouts.

Deployment Constraints

Real-time performance for OOD detection and recovery,Integration with robot control systems,Safety considerations during exploratory sampling

Scalability

Scalability depends on the efficiency of the offline RL agent and the sampling strategy.

Regulatory Considerations

Safety standards for autonomous systems and robotics.

Production Readiness

Maturity Level

Research

Time to Market

2-5 years (requires integration and validation in robotic systems)

View Full Paper Back to Papers