arxiv_cl 91% Match Research Paper AI Researchers,LLM Developers,Machine Learning Engineers,AI Agent Developers 1 week ago

ToolDreamer: Instilling LLM Reasoning Into Tool Retrievers

large-language-models › model-architecture

📄 Abstract

Abstract: Tool calling has become increasingly popular for Large Language Models (LLMs). However, for large tool sets, the resulting tokens would exceed the LLM's context window limit, making it impossible to include every tool. Hence, an external retriever is used to provide LLMs with the most relevant tools for a query. Existing retrieval models rank tools based on the similarity between a user query and a tool description (TD). This leads to suboptimal retrieval as user requests are often poorly aligned with the language of TD. To remedy the issue, we propose ToolDreamer, a framework to condition retriever models to fetch tools based on hypothetical (synthetic) TD generated using an LLM, i.e., description of tools that the LLM feels will be potentially useful for the query. The framework enables a more natural alignment between queries and tools within the language space of TD's. We apply ToolDreamer on the ToolRet dataset and show that our method improves the performance of sparse and dense retrievers with and without training, thus showcasing its flexibility. Through our proposed framework, our aim is to offload a portion of the reasoning burden to the retriever so that the LLM may effectively handle a large collection of tools without inundating its context window.

Authors (7)

Saptarshi Sengupta

Zhengyu Zhou

Jun Araki

Xingbo Wang

Bingqing Wang

Suhang Wang

+1 more

Submitted

October 22, 2025

arXiv Category

cs.CL

arXiv PDF

Key Contributions

Proposes ToolDreamer, a framework that conditions retriever models to fetch tools based on hypothetical, LLM-generated descriptions. This improves the alignment between user queries and tools, leading to better retrieval performance compared to traditional methods relying solely on static tool descriptions.

Business Value

Enhances the capabilities of AI agents and LLMs by enabling them to more effectively discover and utilize external tools, leading to more powerful and versatile AI applications.

Paper Metadata

Innovation Type

Algorithmic Improvement

Deployment Feasibility

Moderate. Requires integration of the retriever with LLM and tool execution environments.

Limitations Addressed

Suboptimal tool retrieval caused by poor alignment between user query language and static tool descriptions, especially for large tool sets.

Performance Gains

Improved tool retrieval accuracy,Better alignment between queries and tools

Technical Tags

tool callingLLM reasoningtool retrievalretriever modelssynthetic descriptionsquery-tool alignmentsparse retrievalToolRet dataset

Research Topics

Natural Language ProcessingMachine LearningArtificial IntelligenceTool Use in LLMs

Methods & Architectures

Synthetic data generationLLM-conditioned retriever trainingDescription generationSparse retrieval optimization Large Language Models (LLMs)Retriever ModelsSparse Retrieval Models

Applications & Tasks

AI Agents Task Automation Information Retrieval Suboptimal tool retrievalMisalignment between user queries and tool descriptionsHandling large tool sets Retrieving relevant tools for LLM queriesImproving LLM reasoning through better tool access

Datasets & Benchmarks

Datasets

ToolRet dataset

Benchmarks

Improved performance of sparse retrieval models

Related Fields

Artificial IntelligenceMachine LearningSoftware EngineeringInformation Retrieval

Keywords

tool callingLLMretrievaltool useAI agentsreasoningsynthetic datasparse retrievalframeworkNLP

Academic Context

#Natural Language Processing#Machine Learning#Artificial Intelligence#Tool Use in LLMs

Commercial Potential

Potential Products

Enhanced AI agents with better tool utilizationLibraries for tool retrieval optimizationFrameworks for LLM tool integration

Target Industries

TechnologySoftware DevelopmentAI ServicesAutomation

Use Case Examples

An AI agent using a calculator tool more effectivelyA chatbot retrieving and using relevant APIs for user requests

Competitive Edge

Offers a novel approach to tool retrieval by leveraging LLM-generated synthetic descriptions, addressing a key limitation of existing methods that rely on static tool descriptions.

Market Opportunity

Growing demand for capable AI agents and LLM applications.

Revenue Models

Licensing the retrieval technology or offering AI agent development services.

Resource Requirements

Compute Needs

Moderate to High (for training retriever and LLM)

Data Requirements

Tool descriptions, user queries, and potentially synthetic descriptions.

Deployment Constraints

Integration complexity with existing LLM and tool execution systems.

Scalability

The framework is designed to handle large tool sets by improving retrieval efficiency.

Production Readiness

Maturity Level

Research/Development

Time to Market

12-24 months for robust integration.

View Full Paper Back to Papers