Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Multi-modal Large Language Models (MLLMs) have gained significant attention
in both academia and industry for their capabilities in handling multi-modal
tasks. However, these models face challenges in mathematical geometric
reasoning due to the scarcity of high-quality geometric data. To address this
issue, synthetic geometric data has become an essential strategy. Current
methods for generating synthetic geometric data involve rephrasing or expanding
existing problems and utilizing predefined rules and templates to create
geometric images and problems. However, these approaches often produce data
that lacks diversity or is prone to noise. Additionally, the geometric images
synthesized by existing methods tend to exhibit limited variation and deviate
significantly from authentic geometric diagrams. To overcome these limitations,
we propose GeoFM, a novel method for synthesizing geometric data. GeoFM uses
formal languages to explore combinations of conditions within metric space,
generating high-fidelity geometric problems that differ from the originals
while ensuring correctness through a symbolic engine. Experimental results show
that our synthetic data significantly outperforms existing methods. The model
trained with our data surpass the proprietary GPT-4o model by 18.7\% on
geometry problem-solving tasks in MathVista and by 16.5\% on GeoQA.
Additionally, it exceeds the performance of a leading open-source model by
5.7\% on MathVista and by 2.7\% on GeoQA.
Authors (5)
Yuhao Zhang
Dingxin Hu
Tinghao Yu
Hao Liu
Yiting Liu
Submitted
October 31, 2025
Key Contributions
Proposes GeoFM, a novel method for synthesizing geometric data for MLLMs using formal languages to explore conditions within metric spaces. This approach generates diverse, high-quality geometric images and problems, overcoming limitations of existing methods that produce noisy or non-diverse data and deviate from authentic diagrams.
Business Value
Enables the development of more capable MLLMs for tasks involving geometric understanding and reasoning, with applications in education, design, and scientific visualization.