MAXFormer: Enhanced transformer for medical image segmentation with multi-attention and multi-scale features fusion

被引:7
|
作者
Liang, Zhiwei [1 ]
Zhao, Kui [1 ]
Liang, Gang [1 ]
Li, Siyu [1 ]
Wu, Yifei [1 ]
Zhou, Yiping [1 ]
机构
[1] Sichuan Univ, Sch Cyber Sci & Engn, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Transformer; Medical image segmentation; Attention mechanism; NETWORKS;
D O I
10.1016/j.knosys.2023.110987
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks(CNN), especially U-shaped networks, have become the mainstream approach for medical image segmentation. However, due to the intrinsic locality of convolutional operations, CNN has inherent limitations in capturing long-range dependencies. Although Transformer-based methods have demonstrated remarkable performance in computer vision by modeling long-range dependencies, their high computational complexity and reliance on large-scale pre-training present challenges, particularly for higher-resolution medical images. In this paper, we introduce MAXFormer, a U-shaped hierarchical network that effectively leverages global context within individual samples and relationships between different samples. Our Transformer module reformulates the self-attention mechanism into two parts: local-global attention and external attention. The local-global attention provides an efficient alternative to self-attention with linear complexity, employing a parallel architecture that allows local-global spatial interactions. The local attention branch captures high-frequency local information, while the global attention branch captures low-frequency global information. Furthermore, we have designed the Refined Fused Connection module to effectively merge feature outputs from each encoder block with the decoder output, mitigating spatial detail loss due to downsampling. Extensive experiments on two different medical image segmentation datasets show that our proposed method outperforms other state-of-the-art methods without requiring pre-training weights. Code will be available at https://github.com/zhiwei-liang/MAXFormer.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] MAFUNet: Multi-Attention Fusion Network for Medical Image Segmentation
    Wang, Lili
    Zhao, Jiayu
    Yang, Hailu
    [J]. IEEE ACCESS, 2023, 11 : 109793 - 109802
  • [2] MM-UNet: Multi-attention mechanism and multi-scale feature fusion UNet for tumor image segmentation
    Xing, Yaozheng
    Yuan, Jie
    Liu, Qixun
    Peng, Shihao
    Yan, Yan
    Yao, Junyi
    [J]. 2023 2ND ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING, CACML 2023, 2023, : 253 - 257
  • [3] A Multi-scale and Multi-attention Network for Skin Lesion Segmentation
    Wu, Cong
    Zhang, Hang
    Chen, Dingsheng
    Gan, Haitao
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2023, PT IV, 2024, 14450 : 537 - 550
  • [4] Multi-scale Hierarchical Vision Transformer with Cascaded Attention Decoding for Medical Image Segmentation
    Rahman, Md Mostafijur
    Marculescu, Radu
    [J]. MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 1526 - 1544
  • [5] Hyperspectral Image Classification Based on Multi-Scale Convolutional Features and Multi-Attention Mechanisms
    Sun, Qian
    Zhao, Guangrui
    Xia, Xinyuan
    Xie, Yu
    Fang, Chenrong
    Sun, Le
    Wu, Zebin
    Pan, Chengsheng
    [J]. REMOTE SENSING, 2024, 16 (12)
  • [6] Collaborative Attention Guided Multi-Scale Feature Fusion Network for Medical Image Segmentation
    Xu, Zhenghua
    Tian, Biao
    Liu, Shijie
    Wang, Xiangtao
    Yuan, Di
    Gu, Junhua
    Chen, Junyang
    Lukasiewicz, Thomas
    Leung, Victor C. M.
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (02): : 1857 - 1871
  • [7] A Multi-Scale Cross-Fusion Medical Image Segmentation Network Based on Dual-Attention Mechanism Transformer
    Cui, Jianguo
    Wang, Liejun
    Jiang, Shaochen
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (19):
  • [8] MSGAT: Multi-scale gated axial reverse attention transformer network for medical image segmentation
    Liu, Yanjun
    Yun, Haijiao
    Xia, Yang
    Luan, Jinyang
    Li, Mingjing
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 95
  • [9] Adaptive fusion with multi-scale features for interactive image segmentation
    Zongyuan Ding
    Tao Wang
    Quansen Sun
    Hongyuan Wang
    [J]. Applied Intelligence, 2021, 51 : 5610 - 5621
  • [10] Adaptive fusion with multi-scale features for interactive image segmentation
    Ding, Zongyuan
    Wang, Tao
    Sun, Quansen
    Wang, Hongyuan
    [J]. APPLIED INTELLIGENCE, 2021, 51 (08) : 5610 - 5621