HCA-former: Hybrid Convolution Attention Transformer for 3D Medical Image Segmentation

被引:2
|
作者
Yang, Fan [1 ]
Wang, Fan [1 ]
Dong, Pengwei [1 ]
Wang, Bo [1 ]
机构
[1] Ningxia Univ, Sch Elect & Elect Engn, Yinchuan 750021, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional Neural Network; Transformers; 3D medical image segmentation; Double-Former Block; Multi-scale and multi-channel feature information; AXIAL-ATTENTION; NET;
D O I
10.1016/j.bspc.2023.105834
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
In recent years, Transformers have achieved success in the field of medical image segmentation due to their outstanding capability to model long-range dependencies. However, many existing segmentation methods only use Transformer as an auxiliary module to capture global context information in images, limiting the potential of the Transformers. Additionally, self-attention mechanism within the Transformers can lead to attention collapse issues, thus triggering semantic gap between the encoder and decoder. Furthermore, most networks have dif-ficulties in effectively handling multi-scale and multi-channel feature information. To address the above prob-lems, we propose a hybrid Convolutional Neural Networks (CNNs) and Transformers method for medical image segmentation (HCA-Former). We design a local multi-channel attention block (LMCA) to effectively combine the features of CNN and Transformers, enabling multi-channel information extraction and interaction. Using the Double-Former Block (DFB) alleviates the semantic gap between the encoder and decoder, restoring more detailed information. Moreover, the utilization of the global multi-scale attention block (GMSA) can establish information interaction among multi-scale targets, thereby enhancing generalization capability of the model. To validate the effectiveness of our approach, we evaluate the proposed method on three challenging tasks: the MICCAI 2015 Multi-Image Abdominal Marker Challenge (Synapse), Automated Cardiac Diagnosis Dataset (ACDC), and Medical Segmentation Decathlon Brain Tumor Segmentation (MSD brain tumor). Extensive ex-periments demonstrate that our HCA-Former achieved competitive or better performance than state-of-the-art approaches for 3D medical image segmentation.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Hybrid Transformer and Convolution for Medical Image Segmentation
    Wang, Fan
    Wang, Bo
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 156 - 159
  • [2] FATUnetr:fully attention Transformer for 3D medical image segmentation
    Li, QingFeng
    Tong, Jigang
    Yang, Sen
    Du, Shengzhi
    2024 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, ICMA 2024, 2024, : 1415 - 1419
  • [3] A 3D Medical Image Segmentation Framework Fusing Convolution and Transformer Features
    Zhu, Fazhan
    Lv, Jiaxing
    Lu, Kun
    Wang, Wenyan
    Cong, Hongshou
    Zhang, Jun
    Chen, Peng
    Zhao, Yuan
    Wu, Ziheng
    INTELLIGENT COMPUTING THEORIES AND APPLICATION (ICIC 2022), PT I, 2022, 13393 : 772 - 786
  • [4] VSmTrans: A hybrid paradigm integrating self-attention and convolution for 3D medical image segmentation
    Liu, Tiange
    Bai, Qingze
    Torigian, Drew A.
    Tong, Yubing
    Udupa, Jayaram K.
    MEDICAL IMAGE ANALYSIS, 2024, 98
  • [5] D-former: a U-shaped Dilated Transformer for 3D medical image segmentation
    Wu, Yixuan
    Liao, Kuanlun
    Chen, Jintai
    Wang, Jinhong
    Chen, Danny Z.
    Gao, Honghao
    Wu, Jian
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02): : 1931 - 1944
  • [6] D-former: a U-shaped Dilated Transformer for 3D medical image segmentation
    Yixuan Wu
    Kuanlun Liao
    Jintai Chen
    Jinhong Wang
    Danny Z. Chen
    Honghao Gao
    Jian Wu
    Neural Computing and Applications, 2023, 35 : 1931 - 1944
  • [7] Hybrid 3D Medical Image Segmentation Using CNN and Frequency Transformer Fusion
    Labbihi, Ismayl
    Meslouhi, Othmane El
    Elassad, Zouhair Elamrani Abou
    Benaddy, Mohamed
    Kardouchi, Mustapha
    Akhloufi, Moulay
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024,
  • [8] A hybrid framework for 3D medical image segmentation
    Chen, T
    Metaxas, D
    MEDICAL IMAGE ANALYSIS, 2005, 9 (06) : 547 - 565
  • [9] Volumetric Attention for 3D Medical Image Segmentation and Detection
    Wang, Xudong
    Han, Shizhong
    Chen, Yunqiang
    Gao, Dashan
    Vasconcelos, Nuno
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT VI, 2019, 11769 : 175 - 184
  • [10] TPAFNet: Transformer-Driven Pyramid Attention Fusion Network for 3D Medical Image Segmentation
    Li, Zheng
    Zhang, Jinhui
    Wei, Siyi
    Gao, Yueyang
    Cao, Chengwei
    Wu, Zhiwei
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (11) : 6803 - 6814