Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: This paper proposes a high-precision semantic segmentation method based on an
improved TransUNet architecture to address the challenges of complex lesion
structures, blurred boundaries, and significant scale variations in skin lesion
images. The method integrates a transformer module into the traditional
encoder-decoder framework to model global semantic information, while retaining
a convolutional branch to preserve local texture and edge features. This
enhances the model's ability to perceive fine-grained structures. A
boundary-guided attention mechanism and multi-scale upsampling path are also
designed to improve lesion boundary localization and segmentation consistency.
To verify the effectiveness of the approach, a series of experiments were
conducted, including comparative studies, hyperparameter sensitivity analysis,
data augmentation effects, input resolution variation, and training data split
ratio tests. Experimental results show that the proposed model outperforms
existing representative methods in mIoU, mDice, and mAcc, demonstrating
stronger lesion recognition accuracy and robustness. In particular, the model
achieves better boundary reconstruction and structural recovery in complex
scenarios, making it well-suited for the key demands of automated segmentation
tasks in skin lesion analysis.