UCSwin-UNet model for medical image segmentation based on cardiac haemangioma

被引:0
|
作者
Shi, Jian-Ting [1 ]
Qu, Gui-Xu [1 ]
Li, Zhi-Jun [2 ]
机构
[1] Heilongjiang Univ Sci & Technol, Sch Comp & Informat Engn, Harbin, Peoples R China
[2] Wuzhou Univ, Guangxi Key Lab Machine Vis & Intelligent Control, Wuzhou 543002, Peoples R China
关键词
biomedical imaging; biomedical ultrasonics; blood vessels; convolutional neural nets; image segmentation; U-NET ARCHITECTURE; DIAGNOSIS;
D O I
10.1049/ipr2.13175
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cardiac hemangioma is a rare benign tumour that presents diagnostic challenges due to its variable clinical symptoms, imaging features, and locations. This study proposes a novel segmentation method based on a Convolutional Neural Network (CNN) and Transformer integration, with Swin-UNet as the core model. We incorporated a U-shaped convolutional neural network block into the original jump connection of Swin-UNet. The Binary Cross Entropy Loss (BCE Loss) algorithm was added, and the learning rate decay algorithm was modified to select the appropriate one by comparing loss values. This paper utilizes the publicly available cardiac angioma dataset in AI Studio, consisting of 215 images for training and testing. To evaluate the effectiveness of the proposed model, this paper demonstrates its optimality through ablation experiments and comparisons with other mainstream models. The comparison experiments show that this model improves Dice by approximately 12%, HD95 by approximately 4.7 mm, Accuracy by approximately 6.1%, and F1 score by 0.11 compared to models such as UNet, UNet++, and Deeplabv3+, etc. For the recently proposed SOTO models, such as TransUNet, Swin-UNet, and MultiResUnet, the Dice score improved by about 1.2%, HD95 reduced by about 1mm, Accuracy improved by about 0.3%, and F1 score improved by 0.015. This study introduces the UCSwin-UNet model, which adopts a U-shaped convolutional framework in the original model. After introducing a new learning rate decay strategy and incorporating the BCE Loss into the loss function, a revaluation of the weight allocation has been undertaken for each component within the loss formula. This model enhances the extraction of local features while introducing non-linearity, and multiple experiments were conducted in the paper to validate its effectiveness, accompanied by a visualization demonstration. image
引用
收藏
页数:14
相关论文
共 50 条
  • [41] ConvWin-UNet: UNet-like hierarchical vision Transformer combined with convolution for medical image segmentation
    Feng, Xiaomeng
    Wang, Taiping
    Yang, Xiaohang
    Zhang, Minfei
    Guo, Wanpeng
    Wang, Weina
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (01) : 128 - 144
  • [42] A Comprehensive Exploration of L-UNet Approach: Revolutionizing Medical Image Segmentation
    Alafer F.
    Siddiqi M.H.
    Khan M.S.
    Ahmad I.
    Alhujaili S.
    Alrowaili Z.
    Alshabibi A.S.
    IEEE Access, 2024, 12 : 1 - 1
  • [43] A novel full-convolution UNet-transformer for medical image segmentation
    Zhu, Tianyou
    Ding, Derui
    Wang, Feng
    Liang, Wei
    Wang, Bo
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [44] CellSegUNet: an improved deep segmentation model for the cell segmentation based on UNet++ and residual UNet models
    Sedat Metlek
    Neural Computing and Applications, 2024, 36 : 5799 - 5825
  • [45] ERDUnet: An Efficient Residual Double-Coding Unet for Medical Image Segmentation
    Li, Hao
    Zhai, Di-Hua
    Xia, Yuanqing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2083 - 2096
  • [46] N-Net: an UNet architecture with dual encoder for medical image segmentation
    Liang, Bingtao
    Tang, Chen
    Zhang, Wei
    Xu, Min
    Wu, Tianbo
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (06) : 3073 - 3081
  • [47] Pie-UNet: A Novel Parallel Interaction Encoder for Medical Image Segmentation
    Jiang, Youtao
    Zhang, Xiaoqian
    Chen, Yufeng
    Yang, Shukai
    Sun, Feng
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT II, 2023, 14255 : 558 - 569
  • [48] N-Net: an UNet architecture with dual encoder for medical image segmentation
    Bingtao Liang
    Chen Tang
    Wei Zhang
    Min Xu
    Tianbo Wu
    Signal, Image and Video Processing, 2023, 17 : 3073 - 3081
  • [49] VIG-UNET: VISION GRAPH NEURAL NETWORKS FOR MEDICAL IMAGE SEGMENTATION
    Jiang, Juntao
    Chen, Xiyu
    Tian, Guanzhong
    Liu, Yong
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [50] DSTUNET: UNET WITH EFFICIENT DENSE SWIN TRANSFORMER PATHWAY FOR MEDICAL IMAGE SEGMENTATION
    Cai, Zhuotong
    Xin, Jingmin
    Shi, Peiwen
    Wu, Jiayi
    Zheng, Nanning
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,