Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Long-horizon contact-rich bimanual manipulation presents a significant
challenge, requiring complex coordination involving a mixture of parallel
execution and sequential collaboration between arms. In this paper, we
introduce a hierarchical framework that frames this challenge as an integrated
skill planning & scheduling problem, going beyond purely sequential
decision-making to support simultaneous skill invocation. Our approach is built
upon a library of single-arm and bimanual primitive skills, each trained using
Reinforcement Learning (RL) in GPU-accelerated simulation. We then train a
Transformer-based planner on a dataset of skill compositions to act as a
high-level scheduler, simultaneously predicting the discrete schedule of skills
as well as their continuous parameters. We demonstrate that our method achieves
higher success rates on complex, contact-rich tasks than end-to-end RL
approaches and produces more efficient, coordinated behaviors than traditional
sequential-only planners.
Authors (4)
Weikang Wan
Fabio Ramos
Xuning Yang
Caelan Garrett
Submitted
October 29, 2025
Key Contributions
Introduces a hierarchical framework for long-horizon bimanual manipulation that integrates skill planning and scheduling, enabling simultaneous skill invocation. It uses RL to train primitive skills and a Transformer planner to schedule discrete skills and continuous parameters, achieving higher success rates and more efficient coordination than existing methods.
Business Value
Enables more sophisticated and efficient robotic automation for complex assembly and manipulation tasks in manufacturing and logistics, potentially increasing productivity and reducing errors.