arxiv_ai 95% Match Research Paper AI Researchers,ML Engineers,Developers of Autonomous Systems,Robotics Engineers 1 week ago

GAP: Graph-Based Agent Planning with Parallel Tool Use and Reinforcement Learning

large-language-models › reasoning

📄 Abstract

Abstract: Autonomous agents powered by large language models (LLMs) have shown impressive capabilities in tool manipulation for complex task-solving. However, existing paradigms such as ReAct rely on sequential reasoning and execution, failing to exploit the inherent parallelism among independent sub-tasks. This sequential bottleneck leads to inefficient tool utilization and suboptimal performance in multi-step reasoning scenarios. We introduce Graph-based Agent Planning (GAP), a novel framework that explicitly models inter-task dependencies through graph-based planning to enable adaptive parallel and serial tool execution. Our approach trains agent foundation models to decompose complex tasks into dependency-aware sub-task graphs, autonomously determining which tools can be executed in parallel and which must follow sequential dependencies. This dependency-aware orchestration achieves substantial improvements in both execution efficiency and task accuracy. To train GAP, we construct a high-quality dataset of graph-based planning traces derived from the Multi-Hop Question Answering (MHQA) benchmark. We employ a two-stage training strategy: supervised fine-tuning (SFT) on the curated dataset, followed by reinforcement learning (RL) with a correctness-based reward function on strategically sampled queries where tool-based reasoning provides maximum value. Experimental results on MHQA datasets demonstrate that GAP significantly outperforms traditional ReAct baselines, particularly on multi-step retrieval tasks, while achieving dramatic improvements in tool invocation efficiency through intelligent parallelization. The project page is available at: https://github.com/WJQ7777/Graph-Agent-Planning.

Authors (7)

Jiaqi Wu

Qinlao Zhao

Zefeng Chen

Kai Qin

Yifei Zhao

Xueqian Wang

+1 more

Submitted

October 29, 2025

arXiv Category

cs.AI

arXiv PDF

Key Contributions

GAP (Graph-based Agent Planning) is a novel framework that enables autonomous agents powered by LLMs to exploit parallelism in complex task-solving. It explicitly models inter-task dependencies using graphs, allowing for adaptive parallel and serial tool execution, thereby overcoming the sequential bottleneck of existing paradigms like ReAct and significantly improving efficiency and accuracy.

Business Value

Enables the development of more efficient and capable autonomous agents for complex tasks, leading to increased productivity in areas like automated customer service, complex data analysis, and potentially robotic operations.

Paper Metadata

Innovation Type

Algorithmic Framework

Deployment Feasibility

Moderate. Requires sophisticated planning and orchestration capabilities, and integration with LLM foundation models and tool APIs.

Limitations Addressed

Addresses the inefficiency and suboptimal performance of existing LLM agent paradigms (like ReAct) that rely on sequential reasoning and execution, failing to leverage parallelism among independent sub-tasks.

Performance Gains

Substantial improvements in both execution efficiency and task accuracy compared to sequential paradigms.

Technical Tags

Autonomous AgentsLLM Tool UseGraph-based PlanningParallel ExecutionReinforcement LearningTask DecompositionDependency AwarenessAgent Foundation Models

Research Topics

Autonomous Agent PlanningLLM Reasoning and Tool UseTask ParallelizationReinforcement Learning for AgentsAI Planning

Methods & Architectures

Graph-based planningParallel and serial tool executionReinforcement Learning (RL)Task decompositionDependency-aware orchestration Graph-based Agent Planning (GAP)Agent Foundation Models

Applications & Tasks

Autonomous Systems Complex Task Solving Robotics (potential) Inefficient sequential reasoning and execution in LLM agentsSuboptimal performance in multi-step reasoningUnderutilization of parallelizable sub-tasks Complex Task SolvingAgent PlanningTool UseReasoning

Related Fields

Artificial IntelligenceMachine LearningAutonomous SystemsPlanning and SchedulingNatural Language Processing

Keywords

Autonomous AgentsLarge Language ModelsPlanningTool UseGraph Neural NetworksParallel ProcessingReinforcement LearningTask DecompositionAIMachine Learning

Academic Context

#Autonomous Agent Planning#LLM Reasoning and Tool Use#Task Parallelization#Reinforcement Learning for Agents#AI Planning

Commercial Potential

Potential Products

Advanced autonomous agents for complex workflowsAI-powered task automation platformsIntelligent assistants capable of parallel operations

Target Industries

TechnologySoftware DevelopmentCustomer ServiceResearch & DevelopmentRobotics

Use Case Examples

Automating complex multi-step processesCoordinating multiple AI tools simultaneouslyDeveloping more efficient AI agents for research and development tasks

Competitive Edge

Offers a significant advantage over sequential agent planning methods by introducing graph-based dependency modeling and enabling parallel tool execution.

Market Opportunity

Large (AI agents, automation market)

Revenue Models

Platform licensingAPI accessspecialized agent development services.

Resource Requirements

Compute Needs

High (for training agent foundation models and executing complex plans)

Data Requirements

Requires datasets for training agent planning models, potentially involving complex tasks with sub-task dependencies.

Deployment Constraints

Complexity of the planning graph, LLM inference latency, tool integration.

Scalability

Scalability depends on the complexity of the task graphs and the efficiency of the planning algorithm.

Production Readiness

Maturity Level

Research

Time to Market

2-4 years (for integration into agent platforms)

Patent Potential

Moderate (novel planning framework)

View Full Paper Back to Papers