arxiv_ai 95% Match Research Paper Medical Imaging Researchers,Radiologists,AI Developers in Healthcare 4 weeks ago

AutoMiSeg: Automatic Medical Image Segmentation via Test-Time Adaptation of Foundation Models

computer-vision › medical-imaging

📄 Abstract

Abstract: Medical image segmentation is vital for clinical diagnosis, yet current deep learning methods often demand extensive expert effort, i.e., either through annotating large training datasets or providing prompts at inference time for each new case. This paper introduces a zero-shot and automatic segmentation pipeline that combines off-the-shelf vision-language and segmentation foundation models. Given a medical image and a task definition (e.g., "segment the optic disc in an eye fundus image"), our method uses a grounding model to generate an initial bounding box, followed by a visual prompt boosting module that enhance the prompts, which are then processed by a promptable segmentation model to produce the final mask. To address the challenges of domain gap and result verification, we introduce a test-time adaptation framework featuring a set of learnable adaptors that align the medical inputs with foundation model representations. Its hyperparameters are optimized via Bayesian Optimization, guided by a proxy validation model without requiring ground-truth labels. Our pipeline offers an annotation-efficient and scalable solution for zero-shot medical image segmentation across diverse tasks. Our pipeline is evaluated on seven diverse medical imaging datasets and shows promising results. By proper decomposition and test-time adaptation, our fully automatic pipeline not only substantially surpasses the previously best-performing method, yielding a 69\% relative improvement in accuracy (Dice Score from 42.53 to 71.81), but also performs competitively with weakly-prompted interactive foundation models.

Key Contributions

Introduces AutoMiSeg, an automatic zero-shot medical image segmentation pipeline using foundation models. It combines a grounding model for bounding boxes, a prompt boosting module, and a promptable segmentation model, enhanced by test-time adaptation with learnable adaptors to bridge the domain gap, eliminating the need for extensive expert annotation or per-case prompting.

Business Value

Significantly reduces the cost and time associated with medical image segmentation, accelerating clinical diagnosis and research by making advanced segmentation accessible without specialized expertise.

Paper Metadata

Innovation Type

Automated Pipeline and Adaptation Method

Deployment Feasibility

Moderate to High, requires integration of multiple foundation models and adaptation modules.

Limitations Addressed

The extensive expert effort required for annotating large medical imaging datasets or providing inference-time prompts for segmentation tasks.

Performance Gains

Achieves automatic segmentation without manual annotation or per-case prompting, addressing domain gap challenges.

Technical Tags

medical image segmentationzero-shot segmentationfoundation modelstest-time adaptationvision-language modelspromptable segmentationdomain adaptationbounding box generation

Research Topics

Medical Image AnalysisZero-Shot SegmentationFoundation Models in HealthcareTest-Time Adaptation

Methods & Architectures

grounding modelvisual prompt boostingpromptable segmentation modeltest-time adaptation frameworklearnable adaptorsBayesian optimization Vision-Language ModelsSegmentation Foundation Models

Applications & Tasks

Medical Imaging Clinical Diagnosis Need for large annotated datasetsInference-time prompt engineeringDomain gap in medical imagingAutomatic segmentation Medical image segmentationZero-shot segmentation of anatomical structures

Related Fields

Medical ImagingComputer VisionDeep LearningFoundation ModelsAI in Healthcare

Keywords

medical image segmentationzero-shotfoundation modelstest-time adaptationvision-languagepromptable segmentationdomain adaptationautomatic segmentationclinical diagnosisbounding boxeye fundus image

Academic Context

#Medical Image Analysis#Zero-Shot Segmentation#Foundation Models in Healthcare#Test-Time Adaptation

Commercial Potential

Potential Products

Automated medical image analysis softwareAI-assisted diagnostic toolsSegmentation modules for PACS systems

Target Industries

HealthcareBiotechnologyMedical Device Manufacturing

Use Case Examples

Segmenting tumors in MRI scansIdentifying anatomical structures in CT scansQuantifying tissue volumes in microscopy images

Competitive Edge

Provides an automated, zero-shot segmentation solution leveraging foundation models, reducing reliance on manual annotation and domain-specific fine-tuning.

Market Opportunity

Large and growing market for AI in medical imaging.

Revenue Models

SaaS for AI analysislicensing to medical device companies.

Resource Requirements

Compute Needs

High (requires running multiple large foundation models)

Data Requirements

Requires access to foundation models and potentially medical image datasets for validation.

Deployment Constraints

Integration complexity and computational cost can be barriers.

Scalability

Scalability depends on the efficiency of the foundation models and the adaptation framework.

Regulatory Considerations

High (HIPAAGDPRFDA approval for clinical use)

Production Readiness

Maturity Level

Research/Development

Time to Market

2-4 years (due to regulatory hurdles and validation)

Patent Potential

Moderate (novel pipeline and adaptation technique)

View Full Paper Back to Papers