Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Medical Visual Question Answering (Med-VQA) holds significant potential for
clinical decision support, yet existing efforts primarily focus on 2D imaging
with limited task diversity. This paper presents 3D-RAD, a large-scale dataset
designed to advance 3D Med-VQA using radiology CT scans. The 3D-RAD dataset
encompasses six diverse VQA tasks: anomaly detection, image observation,
medical computation, existence detection, static temporal diagnosis, and
longitudinal temporal diagnosis. It supports both open- and closed-ended
questions while introducing complex reasoning challenges, including
computational tasks and multi-stage temporal analysis, to enable comprehensive
benchmarking. Extensive evaluations demonstrate that existing vision-language
models (VLMs), especially medical VLMs exhibit limited generalization,
particularly in multi-temporal tasks, underscoring the challenges of real-world
3D diagnostic reasoning. To drive future advancements, we release a
high-quality training set 3D-RAD-T of 136,195 expert-aligned samples, showing
that fine-tuning on this dataset could significantly enhance model performance.
Our dataset and code, aiming to catalyze multimodal medical AI research and
establish a robust foundation for 3D medical visual understanding, are publicly
available at https://github.com/Tang-xiaoxiao/3D-RAD.
Authors (6)
Xiaotang Gai
Jiaxiang Liu
Yichen Li
Zijie Meng
Jian Wu
Zuozhu Liu
Key Contributions
This paper introduces 3D-RAD, a large-scale dataset for 3D Medical Visual Question Answering (Med-VQA) using CT scans, encompassing six diverse VQA tasks including multi-temporal analysis. It highlights the limited generalization of current VLMs, especially medical VLMs, in complex 3D diagnostic reasoning, particularly for multi-temporal tasks, underscoring the need for better models.
Business Value
Accelerates the development of AI-powered clinical decision support systems for radiologists, leading to improved diagnostic accuracy, efficiency, and patient outcomes in medical imaging.