CoST-UNet: Convolution and swin transformer based deep learning architecture for cardiac segmentation

被引:0
|
作者
Islam, Md Rabiul [1 ]
Qaraqe, Marwa [2 ]
Serpedin, Erchin [1 ]
机构
[1] Texas A&M Univ, Elect & Comp Engn, College Stn, TX 77843 USA
[2] Hamad Bin Khalifa Univ, Coll Sci & Engn, Informat & Comp Technol, Doha, Qatar
关键词
Segmentation; Echocardiogram; Vision transformer; CNN-transformer; Local-global; INDEX;
D O I
10.1016/j.bspc.2024.106633
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Automatic segmentation of two-dimensional (2D) echocardiogram is beneficial for heart disease diagnosis and assessment. Convolutional Neural Network (CNN) based U-shaped architectures such as UNet have shown remarkable success for medical images segmentation. UNet generally exhibits limitations for seizing long-range dependencies due to the intrinsic locality of the convolution operation. On the contrary, transformer models can capture global-level information using the multi-head attention mechanism. Taken separately these models exhibit limited localization abilities due to insufficient low-level details. To overcome these limitations, this paper proposes the novel vision transformer CoST-UNet (Convolution and Swin Transformer-based U-shaped Network) architecture that incorporates CNN to leverage spatial information from images in the upper layers and transformer to emphasize global contextual insight in the deeper levels. Unlike existing hybrid models like TransUNet and UNETR, the transformer block of the proposed model employs a Swin Transformer backbone, which ensures linear computational complexity relative to image size. Furthermore, the primary barrier to improving the performance of the transformers, which is the lack of medical images, is effectively addressed by incorporating two convolution layers at the network's uppermost level. The experimental results demonstrate that the model achieved state-of-the-art performance on the ultrasound-based CAMUS dataset (by achieving mean Dice Similarity Coefficients of 0.925, 0.851, and 0.895 for segmenting LV endo , LV epi , and LA, respectively, from apical 4CH echocardiograms), as well as competitive results for MRI-based ACDC datasets, due to its effective capture of local and global context.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Detection of Pavement Cracks by Deep Learning Models of Transformer and UNet
    Zhang, Yu
    Zhang, Lin
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024,
  • [42] A Fabric Defect Segmentation Model Based on Improved Swin-Unet with Gabor Filter
    Xu, Haitao
    Liu, Chengming
    Duan, Shuya
    Ren, Liangpin
    Cheng, Guozhen
    Hao, Bing
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [43] UNet Architecture Based Dental Panoramic Image Segmentation
    Sivagami, S.
    Chitra, P.
    Kailash, G. Sri Ram
    Muralidharan, S. R.
    [J]. 2020 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS SIGNAL PROCESSING AND NETWORKING (WISPNET), 2020, : 187 - 191
  • [44] Single-view Cone beam CT reconstruction with Swin Transformer based deep learning
    Huang, Shien
    Song, Yonghong
    Rong, Junyan
    Liu, Tianshuai
    Huang, Dong
    Lu, Hongbing
    [J]. MEDICAL IMAGING 2023, 2023, 12464
  • [45] Snowmelt Flood Susceptibility Assessment in Kunlun Mountains Based on the Swin Transformer Deep Learning Method
    Yang, Ruibiao
    Zheng, Guoxiong
    Hu, Ping
    Liu, Ying
    Xu, Wenqiang
    Bao, Anming
    [J]. REMOTE SENSING, 2022, 14 (24)
  • [46] Development of Deep Learning Methodology for Maize Seed Variety Recognition Based on Improved Swin Transformer
    Bi, Chunguang
    Hu, Nan
    Zou, Yiqiang
    Zhang, Shuo
    Xu, Suzhen
    Yu, Helong
    [J]. AGRONOMY-BASEL, 2022, 12 (08):
  • [47] Improved deep learning image classification algorithm based on Swin Transformer V2
    Wei, Jiangshu
    Chen, Jinrong
    Wang, Yuchao
    Luo, Hao
    Li, Wujie
    [J]. PEERJ COMPUTER SCIENCE, 2023, 9
  • [48] Improved deep learning image classification algorithm based on Swin Transformer V2
    Wei J.
    Chen J.
    Wang Y.
    Luo H.
    Li W.
    [J]. PeerJ Computer Science, 2023, 9
  • [49] Transformer based deep learning hybrid architecture for phase unwrapping
    Bujagouni, Karthik Goud
    Pradhan, Swarupananda
    [J]. PHYSICA SCRIPTA, 2024, 99 (07)
  • [50] Ship target instance segmentation algorithm based on improved Swin Transformer
    Qian K.
    Li C.
    Chen M.
    Guo J.
    Pan L.
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (10): : 3049 - 3057