arxiv_cl 95% Match Research Paper Data Analysts,Database Administrators,Software Engineers,AI Researchers 1 day ago

MARS-SQL: A multi-agent reinforcement learning framework for Text-to-SQL

reinforcement-learning › multi-agent

📄 Abstract

Abstract: Translating natural language to SQL remains difficult for complex queries. Such queries often need environmental interaction and self-correction. To address this, we introduce MARS-SQL, a novel multi-agent framework that combines principled task decomposition and interactive reinforcement learning (RL). Our system comprises three specialized agents: a Grounding Agent for schema linking, a Generation Agent for query generation, and a Validation Agent for final selection. The core of our framework is the Generation agent, which is trained via a multi-turn RL policy. Adopting a ReAct-style Think-Act-Observe loop, the agent iteratively generates thoughts, executes SQL actions against a live database, and revises its strategy based on execution feedback, enabling dynamic, stateful reasoning and self-correction. At inference time, we generate multiple interaction trajectories to explore diverse reasoning paths. The Validation agent, then selects the optimal trajectory by modeling verification as a next-token prediction task and choosing the solution with the highest generation probability. This structured workflow pipelines specialized agents. It combines interactive RL for generation with generative modeling for verification. The approach proves highly effective for robust and accurate SQL generation. Experiments show that MARS-SQL achieves state-of-the-art Execution Accuracy of 77.84% on the BIRD dev set and 89.75% on the Spider test set. Our code is available at https://github.com/YangHaolin0526/MARS-SQL.

Authors (4)

Haolin Yang

Jipeng Zhang

Zhitao He

Yi R. Fung

Submitted

November 2, 2025

arXiv Category

cs.CL

arXiv PDF

Key Contributions

MARS-SQL is a novel multi-agent RL framework for Text-to-SQL that combines task decomposition and interactive learning. It uses specialized agents (Grounding, Generation, Validation) and a ReAct-style loop for the Generation agent to enable dynamic reasoning and self-correction via database interaction.

Business Value

Empowers non-technical users to query complex databases using natural language, significantly improving data accessibility and accelerating business intelligence processes. This can lead to more data-driven decision-making across organizations.

Paper Metadata

Innovation Type

Framework

Deployment Feasibility

Moderate to High. Requires integration with database systems and careful agent training.

Limitations Addressed

Complex natural language queries are difficult to translate to SQL, often requiring interaction and self-correction. MARS-SQL addresses this by enabling agents to interact with a live database and learn from execution feedback.

Performance Gains

Achieves state-of-the-art performance on complex Text-to-SQL tasks.

Technical Tags

Text-to-SQLMulti-agent Reinforcement LearningLLMTask DecompositionInteractive LearningSelf-correctionSchema LinkingQuery GenerationDatabase Interaction

Research Topics

Natural Language InterfacesDatabase QueryingReinforcement LearningMulti-agent SystemsLLM Reasoning

Methods & Architectures

Multi-agent Reinforcement Learning (MARL)Task DecompositionInteractive Reinforcement LearningReAct (Reasoning-Act) loopThink-Act-ObserveSchema LinkingQuery GenerationValidation Agent Specialized Agents (Grounding, Generation, Validation)LLM (for Generation Agent)

Applications & Tasks

Database Management Business Intelligence Data Analysis Complex natural language to SQL translationNeed for environmental interaction and self-correctionHandling ambiguity in queries Translating natural language to SQLGenerating complex SQL queriesSelf-correcting query generation through database interaction

Related Fields

Natural Language ProcessingDatabase SystemsReinforcement LearningMulti-agent SystemsArtificial Intelligence

Keywords

Text-to-SQLMARLMulti-agentReinforcement LearningLLMSQL GenerationDatabaseNatural Language InterfaceSelf-correctionReActTask Decomposition

Academic Context

#Natural Language Interfaces#Database Querying#Reinforcement Learning#Multi-agent Systems#LLM Reasoning

Technology Stack

Frameworks & Libraries

LLM

Data Processing Tools

Databases

Commercial Potential

Potential Products

Natural language query interface for databasesAutomated SQL generation tool

Target Industries

FinanceHealthcareE-commerceBusiness IntelligenceTechnology

Use Case Examples

Allowing a marketing manager to ask 'Show me sales figures for Q3 by region' and get the correct SQL query executed.Enabling a researcher to retrieve specific data points from a large scientific database using conversational queries.Automating the creation of complex reports by translating business questions into SQL.

Competitive Edge

Outperforms existing Text-to-SQL methods by incorporating multi-agent RL and interactive self-correction, particularly for complex queries.

Resource Requirements

Compute Needs

Significant compute for training multi-agent RL policies and interacting with databases.

Data Requirements

Requires datasets of natural language questions paired with corresponding SQL queries and database schemas.

Deployment Constraints

Requires integration with existing database infrastructure and careful management of agent interactions.

Scalability

Scalability depends on the efficiency of the RL training and the ability to handle large databases.

View Full Paper Back to Papers