MAXFormer: Enhanced transformer for medical image segmentation with multi-attention and multi-scale features fusion

被引：7

作者：

Liang, Zhiwei ^{[1
]}

Zhao, Kui ^{[1
]}

Liang, Gang ^{[1
]}

Li, Siyu ^{[1
]}

Wu, Yifei ^{[1
]}

Zhou, Yiping ^{[1
]}

机构：

[1] Sichuan Univ, Sch Cyber Sci & Engn, Chengdu, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2023年 / 280卷

基金：

中国国家自然科学基金;

关键词：

Transformer; Medical image segmentation; Attention mechanism; NETWORKS;

D O I：

10.1016/j.knosys.2023.110987

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Convolutional neural networks(CNN), especially U-shaped networks, have become the mainstream approach for medical image segmentation. However, due to the intrinsic locality of convolutional operations, CNN has inherent limitations in capturing long-range dependencies. Although Transformer-based methods have demonstrated remarkable performance in computer vision by modeling long-range dependencies, their high computational complexity and reliance on large-scale pre-training present challenges, particularly for higher-resolution medical images. In this paper, we introduce MAXFormer, a U-shaped hierarchical network that effectively leverages global context within individual samples and relationships between different samples. Our Transformer module reformulates the self-attention mechanism into two parts: local-global attention and external attention. The local-global attention provides an efficient alternative to self-attention with linear complexity, employing a parallel architecture that allows local-global spatial interactions. The local attention branch captures high-frequency local information, while the global attention branch captures low-frequency global information. Furthermore, we have designed the Refined Fused Connection module to effectively merge feature outputs from each encoder block with the decoder output, mitigating spatial detail loss due to downsampling. Extensive experiments on two different medical image segmentation datasets show that our proposed method outperforms other state-of-the-art methods without requiring pre-training weights. Code will be available at https://github.com/zhiwei-liang/MAXFormer.

引用

页数：10

共 50 条

[1] MAFUNet: Multi-Attention Fusion Network for Medical Image Segmentation
Wang, Lili
Zhao, Jiayu
Yang, Hailu
[J]. IEEE ACCESS, 2023, 11 : 109793 - 109802
[2] MM-UNet: Multi-attention mechanism and multi-scale feature fusion UNet for tumor image segmentation
Xing, Yaozheng
Yuan, Jie
Liu, Qixun
Peng, Shihao
Yan, Yan
Yao, Junyi
[J]. 2023 2ND ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING, CACML 2023, 2023, : 253 - 257
[3] A Multi-scale and Multi-attention Network for Skin Lesion Segmentation
Wu, Cong
Zhang, Hang
Chen, Dingsheng
Gan, Haitao
[J]. NEURAL INFORMATION PROCESSING, ICONIP 2023, PT IV, 2024, 14450 : 537 - 550
[4] Multi-scale Hierarchical Vision Transformer with Cascaded Attention Decoding for Medical Image Segmentation
Rahman, Md Mostafijur
Marculescu, Radu
[J]. MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 1526 - 1544
[5] Hyperspectral Image Classification Based on Multi-Scale Convolutional Features and Multi-Attention Mechanisms
Sun, Qian
Zhao, Guangrui
Xia, Xinyuan
Xie, Yu
Fang, Chenrong
Sun, Le
Wu, Zebin
Pan, Chengsheng
[J]. REMOTE SENSING, 2024, 16 (12)
[6] Collaborative Attention Guided Multi-Scale Feature Fusion Network for Medical Image Segmentation
Xu, Zhenghua
Tian, Biao
Liu, Shijie
Wang, Xiangtao
Yuan, Di
Gu, Junhua
Chen, Junyang
Lukasiewicz, Thomas
Leung, Victor C. M.
[J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (02): : 1857 - 1871
[7] A Multi-Scale Cross-Fusion Medical Image Segmentation Network Based on Dual-Attention Mechanism Transformer
Cui, Jianguo
Wang, Liejun
Jiang, Shaochen
[J]. APPLIED SCIENCES-BASEL, 2023, 13 (19):
[8] MSGAT: Multi-scale gated axial reverse attention transformer network for medical image segmentation
Liu, Yanjun
Yun, Haijiao
Xia, Yang
Luan, Jinyang
Li, Mingjing
[J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 95
[9] Adaptive fusion with multi-scale features for interactive image segmentation
Zongyuan Ding
Tao Wang
Quansen Sun
Hongyuan Wang
[J]. Applied Intelligence, 2021, 51 : 5610 - 5621
[10] Adaptive fusion with multi-scale features for interactive image segmentation
Ding, Zongyuan
Wang, Tao
Sun, Quansen
Wang, Hongyuan
[J]. APPLIED INTELLIGENCE, 2021, 51 (08) : 5610 - 5621

← 1 2 3 4 5 →