Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ai 95% Match Research Paper Robotics Researchers,AI Engineers,Human-Robot Interaction Specialists,Automation System Integrators 2 weeks ago

R2BC: Multi-Agent Imitation Learning from Single-Agent Demonstrations

robotics › human-robot-interaction
📄 Abstract

Abstract: Imitation Learning (IL) is a natural way for humans to teach robots, particularly when high-quality demonstrations are easy to obtain. While IL has been widely applied to single-robot settings, relatively few studies have addressed the extension of these methods to multi-agent systems, especially in settings where a single human must provide demonstrations to a team of collaborating robots. In this paper, we introduce and study Round-Robin Behavior Cloning (R2BC), a method that enables a single human operator to effectively train multi-robot systems through sequential, single-agent demonstrations. Our approach allows the human to teleoperate one agent at a time and incrementally teach multi-agent behavior to the entire system, without requiring demonstrations in the joint multi-agent action space. We show that R2BC methods match, and in some cases surpass, the performance of an oracle behavior cloning approach trained on privileged synchronized demonstrations across four multi-agent simulated tasks. Finally, we deploy R2BC on two physical robot tasks trained using real human demonstrations.
Authors (6)
Connor Mattson
Varun Raveendra
Ellen Novoseller
Nicholas Waytowich
Vernon J. Lawhern
Daniel S. Brown
Submitted
October 20, 2025
arXiv Category
cs.RO
arXiv PDF

Key Contributions

R2BC (Round-Robin Behavior Cloning) is a novel method that enables effective multi-agent imitation learning using only sequential, single-agent demonstrations provided by a single human operator. This approach significantly simplifies the teaching process for multi-robot systems, as it avoids the need for demonstrations in the complex joint multi-agent action space and can match or surpass oracle performance.

Business Value

Reduces the cost and complexity of training multi-robot systems, making advanced automation more accessible for tasks requiring coordinated robot actions, such as in manufacturing, logistics, and exploration.