arxiv_ai 90% Match Research Paper Formal Mathematicians,Researchers in Automated Theorem Proving,AI Researchers working on LLMs for specialized domains 2 weeks ago

ProofOptimizer: Training Language Models to Simplify Proofs without Human Demonstrations

large-language-models › reasoning

📄 Abstract

Abstract: Neural theorem proving has advanced rapidly in the past year, reaching IMO gold-medalist capabilities and producing formal proofs that span thousands of lines. Although such proofs are mechanically verified by formal systems like Lean, their excessive length renders them difficult for humans to comprehend and limits their usefulness for mathematical insight. Proof simplification is therefore a critical bottleneck. Yet, training data for this task is scarce, and existing methods -- mainly agentic scaffolding with off-the-shelf LLMs -- struggle with the extremely long proofs generated by RL-trained provers. We introduce ProofOptimizer, the first language model trained to simplify Lean proofs without requiring additional human supervision. ProofOptimizer is trained via expert iteration and reinforcement learning, using Lean to verify simplifications and provide training signal. At inference time, it operates within an iterative proof-shortening workflow, progressively reducing proof length. Experiments show that ProofOptimizer substantially compresses proofs generated by state-of-the-art RL-trained provers on standard benchmarks, reducing proof length by 87% on miniF2F, 57% on PutnamBench, and 49% on Seed-Prover's IMO 2025 proofs. Beyond conciseness, the simplified proofs check faster in Lean and further improve downstream prover performance when reused as training data for supervised finetuning.

Authors (5)

Alex Gu

Bartosz Piotrowski

Fabian Gloeckle

Kaiyu Yang

Aram H. Markosyan

Submitted

October 17, 2025

arXiv Category

cs.LG

arXiv PDF

Key Contributions

ProofOptimizer is the first language model trained to simplify Lean proofs without human demonstrations, addressing the scarcity of training data for this task. It uses expert iteration and reinforcement learning with Lean's verification for training signal, and operates iteratively at inference to progressively shorten proofs, making them more comprehensible.

Business Value

Enables more efficient and accessible formal verification of mathematical theorems, potentially accelerating research and development in formal methods and mathematics by making complex proofs easier to understand and verify.

Paper Metadata

Innovation Type

Novel Method/Algorithm

Deployment Feasibility

Moderate. Requires integration with the Lean theorem prover and computational resources for training and inference. The iterative nature might impact real-time usability.

Limitations Addressed

Scarcity of training data for proof simplification,Difficulty in comprehending long formal proofs,Limitations of existing LLM-based methods for long proofs

Technical Tags

language modelsproof simplificationreinforcement learningexpert iterationformal verificationlean theorem provernatural language processingmathematical reasoningiterative refinementsupervised learning

Research Topics

Automated Theorem ProvingMathematical Language ModelsProof AssistanceMachine Learning for MathematicsFormal Verification

Methods & Architectures

Expert IterationReinforcement LearningIterative Proof ShorteningFormal Verification (Lean) Language Model

Applications & Tasks

Formal Mathematics Automated Theorem Proving Mathematical Research Proof ComplexityHuman Comprehension of Formal ProofsData Scarcity for Proof Simplification Proof SimplificationGenerating Human-Readable Formal Proofs

Related Fields

Formal MethodsAutomated ReasoningNatural Language ProcessingMachine LearningComputer Science Theory

Keywords

Proof simplificationLanguage modelsFormal proofsLeanReinforcement learningExpert iterationMathematical insightAutomated theorem provingHuman comprehensionIterative refinement

Academic Context

#Automated Theorem Proving#Mathematical Language Models#Proof Assistance#Machine Learning for Mathematics#Formal Verification

Technology Stack

Frameworks & Libraries

Lean

Commercial Potential

Potential Products

Automated proof assistant toolMathematical insight generation tool

Target Industries

Academia (Mathematics, Computer Science)Software Verification

Use Case Examples

Simplifying complex mathematical proofs generated by AIMaking formal proofs more accessible to human mathematicians

Competitive Edge

Offers a novel approach to proof simplification that does not rely on human demonstrations, differentiating it from methods that might require extensive human annotation or supervision.

Market Opportunity

Niche but growing interest in formal methods and AI for scientific discovery.

Revenue Models

Licensing to research institutionspotential integration into theorem prover software.

Resource Requirements

Compute Needs

Significant for training; moderate for inference depending on proof length and iterations.

Data Requirements

Large corpus of formal proofs (Lean) and their simplified versions (implicitly generated via expert iteration/RL).

Deployment Constraints

Dependency on Lean theorem prover,Computational cost of iterative simplification

Scalability

Scales with the complexity and length of proofs. Training requires substantial compute.

Production Readiness

Maturity Level

Research/Development

Time to Market

1-3 years

Patent Potential

Low

View Full Paper Back to Papers