Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Imitation Learning (IL) is a natural way for humans to teach robots,
particularly when high-quality demonstrations are easy to obtain. While IL has
been widely applied to single-robot settings, relatively few studies have
addressed the extension of these methods to multi-agent systems, especially in
settings where a single human must provide demonstrations to a team of
collaborating robots. In this paper, we introduce and study Round-Robin
Behavior Cloning (R2BC), a method that enables a single human operator to
effectively train multi-robot systems through sequential, single-agent
demonstrations. Our approach allows the human to teleoperate one agent at a
time and incrementally teach multi-agent behavior to the entire system, without
requiring demonstrations in the joint multi-agent action space. We show that
R2BC methods match, and in some cases surpass, the performance of an oracle
behavior cloning approach trained on privileged synchronized demonstrations
across four multi-agent simulated tasks. Finally, we deploy R2BC on two
physical robot tasks trained using real human demonstrations.
Authors (6)
Connor Mattson
Varun Raveendra
Ellen Novoseller
Nicholas Waytowich
Vernon J. Lawhern
Daniel S. Brown
Submitted
October 20, 2025
Key Contributions
R2BC (Round-Robin Behavior Cloning) is a novel method that enables effective multi-agent imitation learning using only sequential, single-agent demonstrations provided by a single human operator. This approach significantly simplifies the teaching process for multi-robot systems, as it avoids the need for demonstrations in the complex joint multi-agent action space and can match or surpass oracle performance.
Business Value
Reduces the cost and complexity of training multi-robot systems, making advanced automation more accessible for tasks requiring coordinated robot actions, such as in manufacturing, logistics, and exploration.