An effective CNN and Transformer complementary network for medical image segmentation

被引:150
|
作者
Yuan, Feiniu [1 ,3 ,4 ]
Zhang, Zhengxiao [1 ,3 ,4 ]
Fang, Zhijun [2 ]
机构
[1] Shanghai Normal Univ SHNU, Coll Informat Mech & Elect Engn, Shanghai 201418, Peoples R China
[2] Donghua Univ, Sch Comp Sci & Technol, Shanghai 201620, Peoples R China
[3] Shanghai Normal Univ, Res Base Online Educ Shanghai Middle & Primary Sch, Shanghai 201418, Peoples R China
[4] Shanghai Normal Univ, Shanghai Engn Res Ctr Intelligent Educ & Bigdata, Shanghai 200234, Peoples R China
基金
中国国家自然科学基金;
关键词
Transformer; Medical image segmentation; Feature complementary module; Cross -domain fusion; Convolutional Neural Network; ATTENTION;
D O I
10.1016/j.patcog.2022.109228
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Transformer network was originally proposed for natural language processing. Due to its powerful representation ability for long-range dependency, it has been extended for vision tasks in recent years. To fully utilize the advantages of Transformers and Convolutional Neural Networks (CNNs), we propose a CNN and Transformer Complementary Network (CTC -Net) for medical image segmentation. We first de-sign two encoders by Swin Transformers and Residual CNNs to produce complementary features in Trans-former and CNN domains, respectively. Then we cross-wisely concatenate these complementary features to propose a Cross-domain Fusion Block (CFB) for effectively blending them. In addition, we compute the correlation between features from the CNN and Transformer domains, and apply channel attention to the self-attention features by Transformers for capturing dual attention information. We incorporate cross-domain fusion, feature correlation and dual attention together to propose a Feature Complementary Module (FCM) for improving the representation ability of features. Finally, we design a Swin Transformer decoder to further improve the representation ability of long-range dependencies, and propose to use skip connections between the Transformer decoded features and the complementary features for extract-ing spatial details, contextual semantics and long-range information. Skip connections are performed in different levels for enhancing multi-scale invariance. Experimental results show that our CTC -Net signifi-cantly surpasses the state-of-the-art image segmentation models based on CNNs, Transformers, and even Transformer and CNN combined models designed for medical image segmentation. It achieves superior performance on different medical applications, including multi-organ segmentation and cardiac segmen-tation. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] TFCNs: A CNN-Transformer Hybrid Network for Medical Image Segmentation
    Li, Zihan
    Li, Dihan
    Xu, Cangbai
    Wang, Weice
    Hong, Qingqi
    Li, Qingde
    Tian, Jie
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 781 - 792
  • [2] CONVFORMER: COMBINING CNN AND TRANSFORMER FOR MEDICAL IMAGE SEGMENTATION
    Gu, Pengfei
    Zhang, Yejia
    Wang, Chaoli
    Chen, Danny Z.
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [3] SEGTRANSVAE: HYBRID CNN - TRANSFORMER WITH REGULARIZATION FOR MEDICAL IMAGE SEGMENTATION
    Quan-Dung Pham
    Hai Nguyen-Truong
    Nam Nguyen Phuong
    Nguyen, Khoa N. A.
    Nguyen, Chanh D. T.
    Bui, Trung
    Truong, Steven Q. H.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [4] SMESwin Unet: Merging CNN and Transformer for Medical Image Segmentation
    Wang, Ziheng
    Min, Xiongkuo
    Shi, Fangyu
    Jin, Ruinian
    Nawrin, Saida S.
    Yu, Ichen
    Nagatomi, Ryoichi
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 517 - 526
  • [5] From CNN to Transformer: A Review of Medical Image Segmentation Models
    Yao, Wenjian
    Bai, Jiajun
    Liao, Wei
    Chen, Yuheng
    Liu, Mengjuan
    Xie, Yao
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (04): : 1529 - 1547
  • [6] CATS: COMPLEMENTARY CNN AND TRANSFORMER ENCODERS FOR SEGMENTATION
    Li, Hao
    Hu, Dewei
    Liu, Han
    Wang, Jiacheng
    Oguz, Ipek
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [7] CoTrFuse: a novel framework by fusing CNN and transformer for medical image segmentation
    Chen, Yuanbin
    Wang, Tao
    Tang, Hui
    Zhao, Longxuan
    Zhang, Xinlin
    Tan, Tao
    Gao, Qinquan
    Du, Min
    Tong, Tong
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (17):
  • [9] FDB-Net: Fusion double branch network combining CNN and transformer for medical image segmentation
    Jiang, Zhongchuan
    Wu, Yun
    Huang, Lei
    Gu, Maohua
    JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY, 2024, 32 (04) : 931 - 951
  • [10] SPCTNet: A Series-Parallel CNN and Transformer Network for 3D Medical Image Segmentation
    Yu, Bin
    Zhou, Quan
    Zhang, Xuming
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 376 - 387