Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Diagrams play a central role in research papers for conveying ideas, yet they
are often notoriously complex and labor-intensive to create. Although diagrams
are presented as images, standard image generative models struggle to produce
clear diagrams with well-defined structure. We argue that a promising direction
is to generate demonstration diagrams directly in textual form as SVGs, which
can leverage recent advances in large language models (LLMs). However, due to
the complexity of components and the multimodal nature of diagrams,
sufficiently discriminative and explainable metrics for evaluating the quality
of LLM-generated diagrams remain lacking. In this paper, we propose
DiagramEval, a novel evaluation metric designed to assess demonstration
diagrams generated by LLMs. Specifically, DiagramEval conceptualizes diagrams
as graphs, treating text elements as nodes and their connections as directed
edges, and evaluates diagram quality using two new groups of metrics: node
alignment and path alignment. For the first time, we effectively evaluate
diagrams produced by state-of-the-art LLMs on recent research literature,
quantitatively demonstrating the validity of our metrics. Furthermore, we show
how the enhanced explainability of our proposed metrics offers valuable
insights into the characteristics of LLM-generated diagrams. Code:
https://github.com/ulab-uiuc/diagram-eval.
Authors (2)
Chumeng Liang
Jiaxuan You
Submitted
October 29, 2025
Key Contributions
This paper introduces DiagramEval, a novel evaluation metric for LLM-generated diagrams, addressing the lack of suitable metrics. DiagramEval conceptualizes diagrams as graphs, using text elements as nodes and connections as edges, and proposes new metrics based on node and edge properties to assess diagram quality, particularly for diagrams generated in SVG format.
Business Value
Improves the quality and reliability of AI-generated diagrams, making them more useful for technical documentation, research communication, and educational materials, thereby saving time and effort.