HCA-former: Hybrid Convolution Attention Transformer for 3D Medical Image Segmentation

被引:2
|
作者
Yang, Fan [1 ]
Wang, Fan [1 ]
Dong, Pengwei [1 ]
Wang, Bo [1 ]
机构
[1] Ningxia Univ, Sch Elect & Elect Engn, Yinchuan 750021, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional Neural Network; Transformers; 3D medical image segmentation; Double-Former Block; Multi-scale and multi-channel feature information; AXIAL-ATTENTION; NET;
D O I
10.1016/j.bspc.2023.105834
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
In recent years, Transformers have achieved success in the field of medical image segmentation due to their outstanding capability to model long-range dependencies. However, many existing segmentation methods only use Transformer as an auxiliary module to capture global context information in images, limiting the potential of the Transformers. Additionally, self-attention mechanism within the Transformers can lead to attention collapse issues, thus triggering semantic gap between the encoder and decoder. Furthermore, most networks have dif-ficulties in effectively handling multi-scale and multi-channel feature information. To address the above prob-lems, we propose a hybrid Convolutional Neural Networks (CNNs) and Transformers method for medical image segmentation (HCA-Former). We design a local multi-channel attention block (LMCA) to effectively combine the features of CNN and Transformers, enabling multi-channel information extraction and interaction. Using the Double-Former Block (DFB) alleviates the semantic gap between the encoder and decoder, restoring more detailed information. Moreover, the utilization of the global multi-scale attention block (GMSA) can establish information interaction among multi-scale targets, thereby enhancing generalization capability of the model. To validate the effectiveness of our approach, we evaluate the proposed method on three challenging tasks: the MICCAI 2015 Multi-Image Abdominal Marker Challenge (Synapse), Automated Cardiac Diagnosis Dataset (ACDC), and Medical Segmentation Decathlon Brain Tumor Segmentation (MSD brain tumor). Extensive ex-periments demonstrate that our HCA-Former achieved competitive or better performance than state-of-the-art approaches for 3D medical image segmentation.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] STA-Former: enhancing medical image segmentation with Shrinkage Triplet Attention in a hybrid CNN-Transformer model
    Liu, Yuzhao
    Han, Liming
    Yao, Bin
    Li, Qing
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (02) : 1901 - 1910
  • [22] STA-Former: enhancing medical image segmentation with Shrinkage Triplet Attention in a hybrid CNN-Transformer model
    Yuzhao Liu
    Liming Han
    Bin Yao
    Qing Li
    Signal, Image and Video Processing, 2024, 18 : 1901 - 1910
  • [23] Large-Kernel Attention for 3D Medical Image Segmentation
    Li, Hao
    Nan, Yang
    Del Ser, Javier
    Yang, Guang
    COGNITIVE COMPUTATION, 2024, 16 (04) : 2063 - 2077
  • [24] Efficient Folded Attention for 3D Medical Image Reconstruction and Segmentation
    Zhang, Hang
    Zhang, Jinwei
    Wang, Rongguang
    Zhang, Qihao
    Spincemaille, Pascal
    Nguyen, Thanh D.
    Wang, Yi
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10868 - 10876
  • [25] LiteTrans: Reconstruct Transformer with Convolution for Medical Image Segmentation
    Xu, Shuying
    Quan, Hongyan
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2021, 2021, 13064 : 300 - 313
  • [26] PFormer: An efficient CNN-Transformer hybrid network with content-driven P-attention for 3D medical image segmentation
    Gao, Yueyang
    Zhang, Jinhui
    Wei, Siyi
    Li, Zheng
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 101
  • [27] H2Former: An Efficient Hierarchical Hybrid Transformer for Medical Image Segmentation
    He, Along
    Wang, Kai
    Li, Tao
    Du, Chengkun
    Xia, Shuang
    Fu, Huazhu
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (09) : 2763 - 2775
  • [28] MCRformer: Morphological constraint reticular transformer for 3D medical image segmentation
    Li, Jun
    Chen, Nan
    Zhou, Han
    Lai, Taotao
    Dong, Heng
    Feng, Chunhui
    Chen, Riqing
    Yang, Changcai
    Cai, Fanggang
    Wei, Lifang
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 232
  • [29] DAST: Differentiable Architecture Search with Transformer for 3D Medical Image Segmentation
    Yang, Dong
    Xu, Ziyue
    He, Yufan
    Nath, Vishwesh
    Li, Wenqi
    Myronenko, Andriy
    Hatamizadeh, Ali
    Zhao, Can
    Roth, Holger R.
    Xu, Daguang
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT III, 2023, 14222 : 747 - 756
  • [30] A Transformer-Based Network for Anisotropic 3D Medical Image Segmentation
    Guo, Danfeng
    Terzopoulos, Demetri
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8857 - 8861