CiT-Net: Convolutional Neural Networks Hand in Hand with Vision Transformers for Medical Image Segmentation

被引:0
|
作者
Lei, Tao [1 ,2 ]
Sun, Rui [1 ]
Wang, Xuan [3 ]
Wang, Yingbo [1 ]
He, Xi [1 ]
Nandi, Asoke [4 ]
机构
[1] Shaanxi Univ Sci & Technol, Shaanxi Joint Lab Artificial Intelligence, Xian, Peoples R China
[2] Xi An Jiao Tong Univ, Affiliated Hosp 1, Dept Geriatr Surg, Xian, Peoples R China
[3] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian, Peoples R China
[4] Brunel Univ London, Department Elect & Elect Engn, Uxbridge, England
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The hybrid architecture of convolutional neural networks (CNNs) and Transformer are very popular for medical image segmentation. However, it suffers from two challenges. First, although a CNNs branch can capture the local image features using vanilla convolution, it cannot achieve adaptive feature learning. Second, although a Transformer branch can capture the global features, it ignores the channel and cross-dimensional self-attention, resulting in a low segmentation accuracy on complex-content images. To address these challenges, we propose a novel hybrid architecture of convolutional neural networks hand in hand with vision Transformers (CiT-Net) for medical image segmentation. Our network has two advantages. First, we design a dynamic deformable convolution and apply it to the CNNs branch, which over-comes the weak feature extraction ability due to fixed-size convolution kernels and the stiff design of sharing kernel parameters among different in-puts. Second, we design a shifted-window adaptive complementary attention module and a compact convolutional projection. We apply them to the Transformer branch to learn the cross-dimensional long-term dependency for medical images. Experimental results show that our CiT-Net provides better medical image segmentation results than popular SOTA methods. Besides, our CiT-Net requires lower parameters and less computational costs and does not rely on pre-training. The code is publicly available at https://github.com/SR0920/CiT-Net.
引用
收藏
页码:1017 / 1025
页数:9
相关论文
共 50 条
  • [1] Development and Research of Hand Segmentation Algorithms on the Image Based on Convolutional Neural Networks
    Kazdorf, S. Ya.
    Pershina, Zh. S.
    Kolker, A. B.
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL SYMPOSIUM INTELLIGENT SYSTEMS 2018 (INTELS'18), 2019, 150 : 450 - 454
  • [2] Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review
    Takahashi, Satoshi
    Sakaguchi, Yusuke
    Kouno, Nobuji
    Takasawa, Ken
    Ishizu, Kenichi
    Akagi, Yu
    Aoyama, Rina
    Teraya, Naoki
    Bolatkan, Amina
    Shinkai, Norio
    Machino, Hidenori
    Kobayashi, Kazuma
    Asada, Ken
    Komatsu, Masaaki
    Kaneko, Syuzo
    Sugiyama, Masashi
    Hamamoto, Ryuji
    [J]. JOURNAL OF MEDICAL SYSTEMS, 2024, 48 (01)
  • [3] Ctnet: rethinking convolutional neural networks and vision transformer for medical image segmentation
    Zhang, Zhixin
    Jiang, Shuhao
    Pan, Xuhua
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (03) : 2265 - 2275
  • [4] Ctnet: rethinking convolutional neural networks and vision transformer for medical image segmentation
    Zhixin Zhang
    Shuhao Jiang
    Xuhua Pan
    [J]. Signal, Image and Video Processing, 2024, 18 : 2265 - 2275
  • [5] V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation
    Milletari, Fausto
    Navab, Nassir
    Ahmadi, Seyed-Ahmad
    [J]. PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 565 - 571
  • [6] CA-Net: Comprehensive Attention Convolutional Neural Networks for Explainable Medical Image Segmentation
    Gu, Ran
    Wang, Guotai
    Song, Tao
    Huang, Rui
    Aertsen, Michael
    Deprest, Jan
    Ourselin, Sebastien
    Vercauteren, Tom
    Zhang, Shaoting
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (02) : 699 - 711
  • [7] Comparing Vision Transformers and Convolutional Neural Networks for Image Classification: A Literature Review
    Mauricio, Jose
    Domingues, Ines
    Bernardino, Jorge
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [8] THE EFFECT OF PREPROCESSING ON CONVOLUTIONAL NEURAL NETWORKS FOR MEDICAL IMAGE SEGMENTATION
    de Raad, K. B.
    van Garderen, K. A.
    Smits, M.
    van der Voort, S. R.
    Incekara, F.
    Oei, E. H. G.
    Hirvasniemi, J.
    Klein, S.
    Starmans, M. P. A.
    [J]. 2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 655 - 658
  • [9] EEvoU-Net: An ensemble of evolutionary deep fully convolutional neural networks for medical image segmentation
    Hassanzadeh, Tahereh
    Essam, Daryl
    Sarker, Ruhul
    [J]. APPLIED SOFT COMPUTING, 2023, 143
  • [10] Collaborative networks of transformers and convolutional neural networks are powerful and versatile learners for accurate 3D medical image segmentation
    Chen, Yong
    Lu, Xuesong
    Xie, Qinlan
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 164