CONVFORMER: COMBINING CNN AND TRANSFORMER FOR MEDICAL IMAGE SEGMENTATION

被引:1
|
作者
Gu, Pengfei [1 ]
Zhang, Yejia [1 ]
Wang, Chaoli [1 ]
Chen, Danny Z. [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
关键词
D O I
10.1109/ISBI53787.2023.10230838
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural network (CNN) based methods have achieved great successes in medical image segmentation, but their capability to learn global representations is still limited due to using small effective receptive fields of convolution operations. Transformer based methods are capable of modelling long-range dependencies of information for capturing global representations, yet their ability to model local context is lacking. Integrating CNN and Transformer to learn both local and global representations while exploring multi-scale features is instrumental in further improving medical image segmentation. In this paper, we propose a hierarchical CNN and Transformer hybrid architecture, called ConvFormer, for medical image segmentation. ConvFormer is based on several simple yet effective designs. (1) A feed forward module of Deformable Transformer (DeTrans) is re-designed to introduce local information, called Enhanced DeTrans. (2) A residual-shaped hybrid stem based on a combination of convolutions and Enhanced DeTrans is developed to capture both local and global representations to enhance representation ability. (3) Our encoder utilizes the residual-shaped hybrid stem in a hierarchical manner to generate feature maps in different scales, and an additional Enhanced DeTrans encoder with residual connections is built to exploit multi-scale features with feature maps of different scales as input. Experiments on several datasets show that our ConvFormer, trained from scratch, outperforms various CNN- or Transformerbased architectures, achieving state-of-the-art performance.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] ConvFormer: Plug-and-Play CNN-Style Transformers for Improving Medical Image Segmentation
    Lin, Xian
    Yan, Zengqiang
    Deng, Xianbo
    Zheng, Chuansheng
    Yu, Li
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 642 - 651
  • [2] Combining frequency transformer and CNNs for medical image segmentation
    Ismayl Labbihi
    Othmane El Meslouhi
    Mohamed Benaddy
    Mustapha Kardouchi
    Moulay Akhloufi
    Multimedia Tools and Applications, 2024, 83 : 21197 - 21212
  • [3] Combining frequency transformer and CNNs for medical image segmentation
    Labbihi, Ismayl
    El Meslouhi, Othmane
    Benaddy, Mohamed
    Kardouchi, Mustapha
    Akhloufi, Moulay
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 21197 - 21212
  • [4] CTransCNN: Combining transformer and CNN in multilabel medical image classification
    Wu, Xin
    Feng, Yue
    Xu, Hong
    Lin, Zhuosheng
    Chen, Tao
    Li, Shengke
    Qiu, Shihan
    Liu, Qichao
    Ma, Yuangang
    Zhang, Shuangsheng
    KNOWLEDGE-BASED SYSTEMS, 2023, 281
  • [5] SEGTRANSVAE: HYBRID CNN - TRANSFORMER WITH REGULARIZATION FOR MEDICAL IMAGE SEGMENTATION
    Quan-Dung Pham
    Hai Nguyen-Truong
    Nam Nguyen Phuong
    Nguyen, Khoa N. A.
    Nguyen, Chanh D. T.
    Bui, Trung
    Truong, Steven Q. H.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [6] An effective CNN and Transformer complementary network for medical image segmentation
    Yuan, Feiniu
    Zhang, Zhengxiao
    Fang, Zhijun
    PATTERN RECOGNITION, 2023, 136
  • [7] SMESwin Unet: Merging CNN and Transformer for Medical Image Segmentation
    Wang, Ziheng
    Min, Xiongkuo
    Shi, Fangyu
    Jin, Ruinian
    Nawrin, Saida S.
    Yu, Ichen
    Nagatomi, Ryoichi
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 517 - 526
  • [8] From CNN to Transformer: A Review of Medical Image Segmentation Models
    Yao, Wenjian
    Bai, Jiajun
    Liao, Wei
    Chen, Yuheng
    Liu, Mengjuan
    Xie, Yao
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (04): : 1529 - 1547
  • [9] FDB-Net: Fusion double branch network combining CNN and transformer for medical image segmentation
    Jiang, Zhongchuan
    Wu, Yun
    Huang, Lei
    Gu, Maohua
    JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY, 2024, 32 (04) : 931 - 951
  • [10] CoTrFuse: a novel framework by fusing CNN and transformer for medical image segmentation
    Chen, Yuanbin
    Wang, Tao
    Tang, Hui
    Zhao, Longxuan
    Zhang, Xinlin
    Tan, Tao
    Gao, Qinquan
    Du, Min
    Tong, Tong
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (17):