An effective CNN and Transformer complementary network for medical image segmentation

被引:150
|
作者
Yuan, Feiniu [1 ,3 ,4 ]
Zhang, Zhengxiao [1 ,3 ,4 ]
Fang, Zhijun [2 ]
机构
[1] Shanghai Normal Univ SHNU, Coll Informat Mech & Elect Engn, Shanghai 201418, Peoples R China
[2] Donghua Univ, Sch Comp Sci & Technol, Shanghai 201620, Peoples R China
[3] Shanghai Normal Univ, Res Base Online Educ Shanghai Middle & Primary Sch, Shanghai 201418, Peoples R China
[4] Shanghai Normal Univ, Shanghai Engn Res Ctr Intelligent Educ & Bigdata, Shanghai 200234, Peoples R China
基金
中国国家自然科学基金;
关键词
Transformer; Medical image segmentation; Feature complementary module; Cross -domain fusion; Convolutional Neural Network; ATTENTION;
D O I
10.1016/j.patcog.2022.109228
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Transformer network was originally proposed for natural language processing. Due to its powerful representation ability for long-range dependency, it has been extended for vision tasks in recent years. To fully utilize the advantages of Transformers and Convolutional Neural Networks (CNNs), we propose a CNN and Transformer Complementary Network (CTC -Net) for medical image segmentation. We first de-sign two encoders by Swin Transformers and Residual CNNs to produce complementary features in Trans-former and CNN domains, respectively. Then we cross-wisely concatenate these complementary features to propose a Cross-domain Fusion Block (CFB) for effectively blending them. In addition, we compute the correlation between features from the CNN and Transformer domains, and apply channel attention to the self-attention features by Transformers for capturing dual attention information. We incorporate cross-domain fusion, feature correlation and dual attention together to propose a Feature Complementary Module (FCM) for improving the representation ability of features. Finally, we design a Swin Transformer decoder to further improve the representation ability of long-range dependencies, and propose to use skip connections between the Transformer decoded features and the complementary features for extract-ing spatial details, contextual semantics and long-range information. Skip connections are performed in different levels for enhancing multi-scale invariance. Experimental results show that our CTC -Net signifi-cantly surpasses the state-of-the-art image segmentation models based on CNNs, Transformers, and even Transformer and CNN combined models designed for medical image segmentation. It achieves superior performance on different medical applications, including multi-organ segmentation and cardiac segmen-tation. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Alternate encoder and dual decoder CNN-Transformer networks for medical image segmentation
    Lin Zhang
    Xinyu Guo
    Hongkun Sun
    Weigang Wang
    Liwei Yao
    Scientific Reports, 15 (1)
  • [22] Multi-Scale Orthogonal Model CNN-Transformer for Medical Image Segmentation
    Zhou, Wuyi
    Zeng, Xianhua
    Zhou, Mingkun
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (10)
  • [23] FTransCNN: Fusing Transformer and a CNN based on fuzzy logic for uncertain medical image segmentation
    Ding, Weiping
    Wang, Haipeng
    Huang, Jiashuang
    Ju, Hengrong
    Geng, Yu
    Lin, Chin-Teng
    Pedrycz, Witold
    INFORMATION FUSION, 2023, 99
  • [24] CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation
    Xie, Yutong
    Zhang, Jianpeng
    Shen, Chunhua
    Xia, Yong
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 171 - 180
  • [25] FAFuse: A Four-Axis Fusion framework of CNN and Transformer for medical image segmentation
    Xu, Shoukun
    Xiao, Dehao
    Yuan, Baohua
    Liu, Yi
    Wang, Xueyuan
    Li, Ning
    Shi, Lin
    Chen, Jialu
    Zhang, Ju-Xiao
    Wang, Yanhao
    Cao, Jianfeng
    Shao, Yeqin
    Jiang, Mingjie
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 166
  • [26] DuAT: Dual-Aggregation Transformer Network for Medical Image Segmentation
    Tang, Feilong
    Xu, Zhongxing
    Huang, Qiming
    Wang, Jinfeng
    Hou, Xianxu
    Su, Jionglong
    Liu, Jingxin
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT V, 2024, 14429 : 343 - 356
  • [27] A hybrid enhanced attention transformer network for medical ultrasound image segmentation
    Jiang, Tao
    Xing, Wenyu
    Yu, Ming
    Ta, Dean
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
  • [28] CCTrans: Improving Medical Image Segmentation with Contoured Convolutional Transformer Network
    Wang, Jingling
    Zhang, Haixian
    Yi, Zhang
    MATHEMATICS, 2023, 11 (09)
  • [29] Laplacian-guided hierarchical transformer: A network for medical image segmentation
    Chen, Yuxiao
    Su, Diwei
    Luo, Jianxu
    Computer Methods and Programs in Biomedicine, 2025, 260
  • [30] Swin Transformer Assisted Prior Attention Network for Medical Image Segmentation
    Liao, Zhihao
    Fan, Neng
    Xu, Kai
    APPLIED SCIENCES-BASEL, 2022, 12 (09):