Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ai 95% Match Research Paper AI Researchers,Multi-Agent Systems Experts,Robotics Engineers,Game Developers 2 weeks ago

CooT: Learning to Coordinate In-Context with Coordination Transformers

reinforcement-learning › multi-agent
📄 Abstract

Abstract: Effective coordination among artificial agents in dynamic and uncertain environments remains a significant challenge in multi-agent systems. Existing approaches, such as self-play and population-based methods, either generalize poorly to unseen partners or require impractically extensive fine-tuning. To overcome these limitations, we propose Coordination Transformers (\coot), a novel in-context coordination framework that uses recent interaction histories to rapidly adapt to unseen partners. Unlike prior approaches that primarily aim to diversify training partners, \coot explicitly focuses on adapting to new partner behaviors by predicting actions aligned with observed interactions. Trained on trajectories collected from diverse pairs of agents with complementary preferences, \coot quickly learns effective coordination strategies without explicit supervision or parameter updates. Across diverse coordination tasks in Overcooked, \coot consistently outperforms baselines including population-based approaches, gradient-based fine-tuning, and a Meta-RL-inspired contextual adaptation method. Notably, fine-tuning proves unstable and ineffective, while Meta-RL struggles to achieve reliable coordination. By contrast, \coot achieves stable, rapid in-context adaptation and is consistently ranked the most effective collaborator in human evaluations.
Authors (5)
Huai-Chih Wang
Hsiang-Chun Chuang
Hsi-Chun Cheng
Dai-Jie Wu
Shao-Hua Sun
Submitted
June 30, 2025
arXiv Category
cs.AI
arXiv PDF

Key Contributions

Introduces Coordination Transformers (CooT), a novel in-context coordination framework that uses recent interaction histories to rapidly adapt to unseen partners without explicit supervision or parameter updates. CooT explicitly focuses on adapting to new partner behaviors by predicting actions aligned with observed interactions, overcoming limitations of poor generalization and extensive fine-tuning.

Business Value

Enables the development of more adaptable and collaborative AI agents for applications like multi-robot systems, autonomous teams, and human-AI collaboration tools, improving efficiency and task success.