Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: We introduce Falcon, a cross-domain Chinese text-to-SQL benchmark grounded in
an enterprise-compatible dialect (MaxCompute/Hive). It contains 600 Chinese
questions over 28 databases; 77% require multi-table reasoning and over half
touch more than four tables. Each example is annotated along SQL-computation
features and Chinese semantics. For evaluation, we release a robust execution
comparator and an automated evaluation pipeline, under which all current
state-of-the-art large-scale models (including Deepseek) achieve accuracies of
at most 50%. Major errors originate from two sources: (1) schema linking in
large enterprise landscapes - hundreds of tables, denormalized fields,
ambiguous column names, implicit foreign-key relations and domain-specific
synonyms that make correct join/column selection difficult; and (2) mapping
concise, colloquial Chinese into the exact operators and predicates required
for analytics - e.g., choosing the correct aggregation and group-by keys,
expressing time windows and granularities, applying unit conversions, handling
NULLs and data-quality rules, and formulating nested or windowed subqueries.
Falcon therefore targets Chinese-specific semantics and enterprise dialects
(abbreviations, business jargon, fuzzy entity references) and provides a
reproducible middle ground before full production deployment by using realistic
enterprise schemas, query templates, an execution comparator, and an automated
evaluation pipeline for end-to-end validation.
Authors (9)
Wenzhen Luo
Wei Guan
Yifan Yao
Yimin Pan
Feng Wang
Zhipeng Yu
+3 more
Submitted
October 23, 2025
Key Contributions
This paper introduces Falcon, a comprehensive Chinese text-to-SQL benchmark specifically designed for enterprise environments, featuring complex queries requiring multi-table reasoning. It also provides a robust evaluation framework and highlights the significant performance gap of current state-of-the-art models on such challenging enterprise data.
Business Value
Enables more accurate and efficient data access for Chinese-speaking enterprise users, potentially democratizing data analysis and reducing reliance on specialized SQL developers.