Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ai 95% Match Research Paper Robotics Researchers,AI Engineers,Control Systems Engineers,Industrial Automation Specialists 1 week ago

Learning to Plan & Schedule with Reinforcement-Learned Bimanual Robot Skills

robotics › manipulation
📄 Abstract

Abstract: Long-horizon contact-rich bimanual manipulation presents a significant challenge, requiring complex coordination involving a mixture of parallel execution and sequential collaboration between arms. In this paper, we introduce a hierarchical framework that frames this challenge as an integrated skill planning & scheduling problem, going beyond purely sequential decision-making to support simultaneous skill invocation. Our approach is built upon a library of single-arm and bimanual primitive skills, each trained using Reinforcement Learning (RL) in GPU-accelerated simulation. We then train a Transformer-based planner on a dataset of skill compositions to act as a high-level scheduler, simultaneously predicting the discrete schedule of skills as well as their continuous parameters. We demonstrate that our method achieves higher success rates on complex, contact-rich tasks than end-to-end RL approaches and produces more efficient, coordinated behaviors than traditional sequential-only planners.
Authors (4)
Weikang Wan
Fabio Ramos
Xuning Yang
Caelan Garrett
Submitted
October 29, 2025
arXiv Category
cs.RO
arXiv PDF

Key Contributions

Introduces a hierarchical framework for long-horizon bimanual manipulation that integrates skill planning and scheduling, enabling simultaneous skill invocation. It uses RL to train primitive skills and a Transformer planner to schedule discrete skills and continuous parameters, achieving higher success rates and more efficient coordination than existing methods.

Business Value

Enables more sophisticated and efficient robotic automation for complex assembly and manipulation tasks in manufacturing and logistics, potentially increasing productivity and reducing errors.