Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_cv 95% Match Research Paper Medical Imaging Researchers,Federated Learning Practitioners,AI in Healthcare Developers 17 hours ago

Real World Federated Learning with a Knowledge Distilled Transformer for Cardiac CT Imaging

computer-vision › medical-imaging
📄 Abstract

Abstract: Federated learning is a renowned technique for utilizing decentralized data while preserving privacy. However, real-world applications often face challenges like partially labeled datasets, where only a few locations have certain expert annotations, leaving large portions of unlabeled data unused. Leveraging these could enhance transformer architectures ability in regimes with small and diversely annotated sets. We conduct the largest federated cardiac CT analysis to date (n=8,104) in a real-world setting across eight hospitals. Our two-step semi-supervised strategy distills knowledge from task-specific CNNs into a transformer. First, CNNs predict on unlabeled data per label type and then the transformer learns from these predictions with label-specific heads. This improves predictive accuracy and enables simultaneous learning of all partial labels across the federation, and outperforms UNet-based models in generalizability on downstream tasks. Code and model weights are made openly available for leveraging future cardiac CT analysis.

Key Contributions

This paper introduces a novel two-step semi-supervised strategy for federated learning in cardiac CT imaging, leveraging knowledge distillation from CNNs into a transformer architecture. This approach effectively utilizes partially labeled and unlabeled data across multiple hospitals, improving predictive accuracy and generalizability, which is crucial for real-world federated learning applications with data heterogeneity.

Business Value

Enables more effective and privacy-preserving analysis of medical imaging data across institutions, potentially leading to earlier disease detection and improved patient outcomes. It addresses the challenge of data silos in healthcare by allowing collaborative model training without sharing raw patient data.