Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Training large language model agents on tasks at the frontier of their
capabilities is key to unlocking advanced reasoning. We introduce a data
synthesis approach inspired by the educational theory of the Zone of Proximal
Development (ZPD), which defines this frontier as tasks an LLM cannot solve
alone but can master with guidance. To operationalize this, we present the
AgentFrontier Engine, an automated pipeline that synthesizes high-quality,
multidisciplinary data situated precisely within the LLM's ZPD. This engine
supports both continued pre-training with knowledge-intensive data and targeted
post-training on complex reasoning tasks. From the same framework, we derive
the ZPD Exam, a dynamic and automated benchmark designed to evaluate agent
capabilities on these frontier tasks. We train AgentFrontier-30B-A3B model on
our synthesized data, which achieves state-of-the-art results on demanding
benchmarks like Humanity's Last Exam, even surpassing some leading proprietary
agents. Our work demonstrates that a ZPD-guided approach to data synthesis
offers a scalable and effective path toward building more capable LLM agents.
Authors (10)
Xuanzhong Chen
Zile Qiao
Guoxin Chen
Liangcai Su
Zhen Zhang
Xinyu Wang
+4 more
Submitted
October 28, 2025
Key Contributions
AgentFrontier introduces a data synthesis approach inspired by the Zone of Proximal Development (ZPD) to train LLM agents on tasks at the frontier of their capabilities. The AgentFrontier Engine automatically synthesizes high-quality, multidisciplinary data situated within the LLM's ZPD for both continued pre-training and targeted post-training. It also introduces the ZPD Exam, a dynamic benchmark for evaluating agent capabilities on these frontier tasks.
Business Value
By enabling LLM agents to tackle more complex reasoning tasks, this research can lead to more capable AI assistants, advanced automation tools, and breakthroughs in fields requiring sophisticated problem-solving, driving innovation across industries.