📄 Abstract
Abstract: Large language models (LLMs) are catalyzing the development of autonomous AI
research agents for scientific and engineering discovery. We present FM Agent,
a novel and general-purpose multi-agent framework that leverages a synergistic
combination of LLM-based reasoning and large-scale evolutionary search to
address complex real-world challenges. The core of FM Agent integrates several
key innovations: 1) a cold-start initialization phase incorporating expert
guidance, 2) a novel evolutionary sampling strategy for iterative optimization,
3) domain-specific evaluators that combine correctness, effectiveness, and
LLM-supervised feedback, and 4) a distributed, asynchronous execution
infrastructure built on Ray. Demonstrating broad applicability, our system has
been evaluated across diverse domains, including operations research, machine
learning, GPU kernel optimization, and classical mathematical problems. FM
Agent reaches state-of-the-art results autonomously, without human
interpretation or tuning -- 1976.3 on ALE-Bench (+5.2\%), 43.56\% on MLE-Bench
(+4.0pp), up to 20x speedups on KernelBench, and establishes new
state-of-the-art(SOTA) results on several classical mathematical problems.
Beyond academic benchmarks, FM Agent shows considerable promise for both
large-scale enterprise R\&D workflows and fundamental scientific research,
where it can accelerate innovation, automate complex discovery processes, and
deliver substantial engineering and scientific advances with broader societal
impact.
Authors (22)
Annan Li
Chufan Wu
Zengle Ge
Yee Hin Chong
Zhinan Hou
Lizhe Cao
+16 more
Submitted
October 30, 2025
Key Contributions
Presents FM Agent, a novel multi-agent framework combining LLM reasoning with large-scale evolutionary search for complex real-world challenges. Key innovations include expert-guided cold-start, evolutionary sampling, domain-specific evaluators, and a distributed infrastructure on Ray, achieving state-of-the-art results autonomously.
Business Value
Accelerates scientific and engineering discovery, potentially leading to breakthroughs and optimized solutions in various industries.