Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ai 95% Match Research Paper AI Researchers,ML Engineers,Developers of multi-agent LLM systems 2 weeks ago

Multi-Agent Collaboration via Evolving Orchestration

large-language-models › reasoning
📄 Abstract

Abstract: Large language models (LLMs) have achieved remarkable results across diverse downstream tasks, but their monolithic nature restricts scalability and efficiency in complex problem-solving. While recent research explores multi-agent collaboration among LLMs, most approaches rely on static organizational structures that struggle to adapt as task complexity and agent numbers grow, resulting in coordination overhead and inefficiencies. To this end, we propose a puppeteer-style paradigm for LLM-based multi-agent collaboration, where a centralized orchestrator ("puppeteer") dynamically directs agents ("puppets") in response to evolving task states. This orchestrator is trained via reinforcement learning to adaptively sequence and prioritize agents, enabling flexible and evolvable collective reasoning. Experiments on closed- and open-domain scenarios show that this method achieves superior performance with reduced computational costs. Analyses further reveal that the key improvements consistently stem from the emergence of more compact, cyclic reasoning structures under the orchestrator's evolution. Our code is available at https://github.com/OpenBMB/ChatDev/tree/puppeteer.
Authors (14)
Yufan Dang
Chen Qian
Xueheng Luo
Jingru Fan
Zihao Xie
Ruijie Shi
+8 more
Submitted
May 26, 2025
arXiv Category
cs.CL
arXiv PDF

Key Contributions

Proposes a puppeteer-style paradigm for LLM multi-agent collaboration where a centralized orchestrator dynamically directs agents. This approach uses reinforcement learning to adaptively sequence and prioritize agents, enabling flexible and evolvable collective reasoning, which outperforms static structures in complex scenarios with reduced computational costs.

Business Value

Enables more efficient and scalable deployment of LLM-based systems for complex tasks, potentially reducing operational costs and improving service quality in areas like customer support or complex data analysis.