DMSA-UNet: Dual Multi-Scale Attention makes UNet more strong for medical image segmentation

被引:0
|
作者
Li, Xiang [1 ]
Fu, Chong [1 ,6 ,7 ]
Wang, Qun [2 ]
Zhang, Wenchao [3 ]
Sham, Chiu-Wing [4 ]
Chen, Junxin [5 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110819, Peoples R China
[2] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
[3] Xian Univ Sci & Technol, Coll Comp Sci & Technol, Xian 710054, Peoples R China
[4] Univ Auckland, Sch Comp Sci, Auckland, New Zealand
[5] Dalian Univ Technol, Sch Software, Dalian 116621, Peoples R China
[6] Northeastern Univ, Key Lab Intelligent Comp Med Image, Minist Educ, Shenyang 110819, Peoples R China
[7] Minist Educ, Engn Res Ctr Secur Technol Complex Network Syst, Shenyang, Peoples R China
关键词
Medical image segmentation; UNet; Dual Multi-Scale Attention;
D O I
10.1016/j.knosys.2024.112050
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Networks (CNNs), particularly UNet, have become prevalent in medical image segmentation tasks. However, CNNs inherently struggle to capture global dependencies owing to their intrinsic localities. Although Transformers have shown superior performance in modeling global dependencies, they encounter the challenges of high model complexity and dependencies on large-scale pre-trained models. Furthermore, the current attention mechanisms of Transformers only consider single-scale feature interactions, making it difficult to analyze feature correlations at different scales in the same attention layer. In this paper, we propose DMSA-UNet, which strengthens the global analysis capability and maximally preserves the local inductive bias capability while maintaining low model complexity. Specifically, we reformulate vanilla self-attention as efficient Dual Multi-Scale Attention (DMSA) that captures multi-scale-enhanced global information along both spatial and channel dimensions with linear complexity and pixel granularity. We also introduce a context-gated linear unit in DMSA for each feature to obtain adaptive attention based on neighboring contexts. To preserve the convolutional properties, DMSAs are inserted directly between the UNet's convolutional blocks rather than replacing them. Because DMSA has multi-scale adaptive aggregation capability, the deepest convolutional block of UNet is removed to mitigate the noise interference caused by fixed convolutional kernels with large receptive fields. We further leverage efficient convolution to reduce computational redundancy. DMSA-UNet is highly competitive in terms of model complexity, with 33% fewer parameters and 15% fewer FLOPs (at 224 2 resolution) than UNet. Extensive experimental results on four different medical datasets demonstrate that DMSA-UNet outperforms other state -of -the -art approaches without any pre-trained models.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] MA-Unet:An improved version of Unet based on multi-scale and attention mechanism for medical image segmentation
    Cai, Yutong
    Wang, Yong
    [J]. THIRD INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION; NETWORK AND COMPUTER TECHNOLOGY (ECNCT 2021), 2022, 12167
  • [2] MFLUnet: multi-scale fusion lightweight Unet for medical image segmentation
    Cao, Dianlei
    Zhang, Rui
    Zhang, Yunfeng
    [J]. BIOMEDICAL OPTICS EXPRESS, 2024, 15 (10): : 5574 - 5591
  • [3] MM-UNet: Multi-attention mechanism and multi-scale feature fusion UNet for tumor image segmentation
    Xing, Yaozheng
    Yuan, Jie
    Liu, Qixun
    Peng, Shihao
    Yan, Yan
    Yao, Junyi
    [J]. 2023 2ND ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING, CACML 2023, 2023, : 253 - 257
  • [4] MSCA-UNet: multi-scale channel attention-based UNet for segmentation of medical ultrasound images
    Chen, Zihan
    Zhu, Haijiang
    Liu, Yutong
    Gao, Xiaoyu
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (05): : 6787 - 6804
  • [5] NFMPAtt-Unet: Neighborhood Fuzzy C-means Multi-scale Pyramid Hybrid Attention Unet for medical image segmentation
    Zhao, Xinpeng
    Xu, Weihua
    [J]. NEURAL NETWORKS, 2024, 178
  • [6] A Novel Multi-Scale Attention PFE-UNet for Forest Image Segmentation
    Zhang, Boyang
    Mu, Hongbo
    Gao, Mingyu
    Ni, Haiming
    Chen, Jianfeng
    Yang, Hong
    Qi, Dawei
    [J]. FORESTS, 2021, 12 (07):
  • [7] MDA-Unet: A Multi-Scale Dilated Attention U-Net for Medical Image Segmentation
    Amer, Alyaa
    Lambrou, Tryphon
    Ye, Xujiong
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [8] MH UNet: A Multi-Scale Hierarchical Based Architecture for Medical Image Segmentation
    Ahmad, Parvez
    Jin, Hai
    Alroobaea, Roobaea
    Qamar, Saqib
    Zheng, Ran
    Alnajjar, Fady
    Aboudi, Fathia
    [J]. IEEE ACCESS, 2021, 9 : 148384 - 148408
  • [9] Improved UNet with Attention for Medical Image Segmentation
    AL Qurri, Ahmed
    Almekkawy, Mohamed
    [J]. SENSORS, 2023, 23 (20)
  • [10] MCNMF-Unet: a mixture Conv-MLP network with multi-scale features fusion Unet for medical image segmentation
    Yuan, Lei
    Song, Jianhua
    Fan, Yazhuo
    [J]. PEERJ COMPUTER SCIENCE, 2024, 10