Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ai 90% Match Research Paper AI Researchers,LLM Developers,Agent Developers,ML Engineers,Robotics Engineers 1 week ago

OrchDAG: Complex Tool Orchestration in Multi-Turn Interactions with Plan DAGs

large-language-models › training-methods
📄 Abstract

Abstract: Agentic tool use has gained traction with the rise of agentic tool calling, yet most existing work overlooks the complexity of multi-turn tool interactions. We introduce OrchDAG, a synthetic data generation pipeline that models tool execution as directed acyclic graphs (DAGs) with controllable complexity. Using this dataset, we benchmark model performance and propose a graph-based reward to enhance RLVR training. Experiments show that the dataset presents a challenging but solvable benchmark, and the proposed reward is effective when combined with GRPO-style algorithms, highlighting the importance of leveraging topological structure and data complexity in multi-turn tool use.
Authors (3)
Yifu Lu
Shengjie Liu
Li Dong
Submitted
October 28, 2025
arXiv Category
cs.AI
arXiv PDF

Key Contributions

OrchDAG introduces a synthetic data generation pipeline that models tool execution as directed acyclic graphs (DAGs) to address the complexity of multi-turn tool interactions. It provides a challenging benchmark dataset and proposes a graph-based reward to enhance RLVR training, demonstrating effectiveness with GRPO-style algorithms.

Business Value

Enables the creation of more robust and capable AI agents that can handle complex, multi-step tasks involving various tools, leading to advanced automation and problem-solving capabilities.