📄 Abstract
Abstract: AI-assisted imaging made substantial advances in tumor diagnosis and
management. However, a major barrier to developing robust oncology foundation
models is the scarcity of large-scale, high-quality annotated datasets, which
are limited by privacy restrictions and the high cost of manual labeling. To
address this gap, we present PASTA, a pan-tumor radiology foundation model
built on PASTA-Gen, a synthetic data framework that generated 30,000 3D CT
scans with pixel-level lesion masks and structured reports of tumors across ten
organ systems. Leveraging this resource, PASTA achieves state-of-the-art
performance on 45 of 46 oncology tasks, including non-contrast CT tumor
screening, lesion segmentation, structured reporting, tumor staging, survival
prediction, and MRI-modality transfer. To assess clinical applicability, we
developed PASTA-AID, a clinical decision support system, and ran a
retrospective simulated clinical trial across two scenarios. For pan-tumor
screening on plain CT with fixed reading time, PASTA-AID increased
radiologists' throughput by 11.1-25.1% and improved sensitivity by 17.0-31.4%
and precision by 10.5-24.9%; additionally, in a diagnosis-aid workflow, it
reduced segmentation time by up to 78.2% and reporting time by up to 36.5%.
Beyond gains in accuracy and efficiency, PASTA-AID narrowed the expertise gap,
enabling less-experienced radiologists to approach expert-level performance.
Together, this work establishes an end-to-end, synthetic data-driven pipeline
spanning data generation, model development, and clinical validation, thereby
demonstrating substantial potential for pan-tumor research and clinical
translation.
Authors (16)
Wenhui Lei
Hanyu Chen
Zitian Zhang
Luyang Luo
Qiong Xiao
Yannian Gu
+10 more
Submitted
February 10, 2025
Key Contributions
Introduces PASTA, a pan-tumor radiology foundation model built on PASTA-Gen, a synthetic data framework generating 30,000 3D CT scans with masks and reports. This addresses data scarcity and privacy issues, enabling state-of-the-art performance across 45 oncology tasks and a clinical decision support system (PASTA-AID).
Business Value
Accelerates AI development in oncology by overcoming data limitations, leading to more accurate and comprehensive diagnostic tools for cancer detection, staging, and prognosis.