Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_cv 90% Match Research Paper Oncologists,Radiologists,Medical AI Researchers,Healthcare Providers 2 weeks ago

A Synthetic Data-Driven Radiology Foundation Model for Pan-tumor Clinical Diagnosis

large-language-models › multimodal-llms
📄 Abstract

Abstract: AI-assisted imaging made substantial advances in tumor diagnosis and management. However, a major barrier to developing robust oncology foundation models is the scarcity of large-scale, high-quality annotated datasets, which are limited by privacy restrictions and the high cost of manual labeling. To address this gap, we present PASTA, a pan-tumor radiology foundation model built on PASTA-Gen, a synthetic data framework that generated 30,000 3D CT scans with pixel-level lesion masks and structured reports of tumors across ten organ systems. Leveraging this resource, PASTA achieves state-of-the-art performance on 45 of 46 oncology tasks, including non-contrast CT tumor screening, lesion segmentation, structured reporting, tumor staging, survival prediction, and MRI-modality transfer. To assess clinical applicability, we developed PASTA-AID, a clinical decision support system, and ran a retrospective simulated clinical trial across two scenarios. For pan-tumor screening on plain CT with fixed reading time, PASTA-AID increased radiologists' throughput by 11.1-25.1% and improved sensitivity by 17.0-31.4% and precision by 10.5-24.9%; additionally, in a diagnosis-aid workflow, it reduced segmentation time by up to 78.2% and reporting time by up to 36.5%. Beyond gains in accuracy and efficiency, PASTA-AID narrowed the expertise gap, enabling less-experienced radiologists to approach expert-level performance. Together, this work establishes an end-to-end, synthetic data-driven pipeline spanning data generation, model development, and clinical validation, thereby demonstrating substantial potential for pan-tumor research and clinical translation.
Authors (16)
Wenhui Lei
Hanyu Chen
Zitian Zhang
Luyang Luo
Qiong Xiao
Yannian Gu
+10 more
Submitted
February 10, 2025
arXiv Category
eess.IV
arXiv PDF

Key Contributions

Introduces PASTA, a pan-tumor radiology foundation model built on PASTA-Gen, a synthetic data framework generating 30,000 3D CT scans with masks and reports. This addresses data scarcity and privacy issues, enabling state-of-the-art performance across 45 oncology tasks and a clinical decision support system (PASTA-AID).

Business Value

Accelerates AI development in oncology by overcoming data limitations, leading to more accurate and comprehensive diagnostic tools for cancer detection, staging, and prognosis.