PolySegNet: improving polyp segmentation through swin transformer and vision transformer fusion

被引:0
|
作者
Lijin, P. [1 ]
Ullah, Mohib [2 ]
Vats, Anuja [2 ]
Cheikh, Faouzi Alaya [3 ]
Kumar, G. Santhosh [1 ]
Nair, Madhu S. [1 ]
机构
[1] Cochin Univ Sci & Technol, Dept Comp Sci, Artificial Intelligence & Comp Vis Lab, Kochi 682022, Kerala, India
[2] Norwegian Univ Sci & Technol, Teknol Vegen 22, N-2815 Gjovik, Norway
[3] Norwegian Univ Sci & Technol, Norwegian Colour & Visual Comp Lab, Teknol Vegen 22, N-2815 Gjovik, Norway
关键词
Swin transformer; Vision transformer; Convolutional neural network; Colorectal cancer; Segmentation;
D O I
10.1007/s13534-024-00415-x
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Colorectal cancer ranks as the second most prevalent cancer worldwide, with a high mortality rate. Colonoscopy stands as the preferred procedure for diagnosing colorectal cancer. Detecting polyps at an early stage is critical for effective prevention and diagnosis. However, challenges in colonoscopic procedures often lead medical practitioners to seek support from alternative techniques for timely polyp identification. Polyp segmentation emerges as a promising approach to identify polyps in colonoscopy images. In this paper, we propose an advanced method, PolySegNet, that leverages both Vision Transformer and Swin Transformer, coupled with a Convolutional Neural Network (CNN) decoder. The fusion of these models facilitates a comprehensive analysis of various modules in our proposed architecture.To assess the performance of PolySegNet, we evaluate it on three colonoscopy datasets, a combined dataset, and their augmented versions. The experimental results demonstrate that PolySegNet achieves competitive results in terms of polyp segmentation accuracy and efficacy, achieving a mean Dice score of 0.92 and a mean Intersection over Union (IoU) of 0.86. These metrics highlight the superior performance of PolySegNet in accurately delineating polyp boundaries compared to existing methods. PolySegNet has shown great promise in accurately and efficiently segmenting polyps in medical images. The proposed method could be the foundation for a new class of transformer-based segmentation models in medical image analysis.
引用
收藏
页码:1421 / 1431
页数:11
相关论文
共 50 条
  • [1] Colorectal polyp segmentation method based on the Swin Transformer and graph reasoning
    Liang L.
    He A.
    Yang Y.
    Wu J.
    [J]. Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2024, 46 (05): : 897 - 907
  • [2] Attention combined pyramid vision transformer for polyp segmentation
    Liu, Xiaogang
    Song, Shuang
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [3] Polyp2Seg: Improved Polyp Segmentation with Vision Transformer
    Mandujano-Cornejo, Vittorino
    Montoya-Zegarra, Javier A.
    [J]. MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2022, 2022, 13413 : 519 - 534
  • [4] Improving the Performance of Human Part Segmentation Based on Swin Transformer
    Du, Juan
    Yang, Tao
    [J]. OPTICAL MEMORY AND NEURAL NETWORKS, 2023, 32 (02) : 101 - 107
  • [5] Improving the Performance of Human Part Segmentation Based on Swin Transformer
    [J]. Optical Memory and Neural Networks, 2023, 32 : 101 - 107
  • [6] FCN-Transformer Feature Fusion for Polyp Segmentation
    Sanderson, Edward
    Matuszewski, Bogdan J.
    [J]. MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2022, 2022, 13413 : 892 - 907
  • [7] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
    Liu, Ze
    Lin, Yutong
    Cao, Yue
    Hu, Han
    Wei, Yixuan
    Zhang, Zheng
    Lin, Stephen
    Guo, Baining
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9992 - 10002
  • [8] A fusion-attention swin transformer for cardiac MRI image segmentation
    Yang, Ruiping
    Liu, Kun
    Liang, Yongquan
    [J]. IET IMAGE PROCESSING, 2024, 18 (01) : 105 - 115
  • [9] Probabilistic Modeling Ensemble Vision Transformer Improves Complex Polyp Segmentation
    Ling, Tianyi
    Wu, Chengyi
    Yu, Huan
    Cai, Tian
    Wang, Da
    Zhou, Yincong
    Chen, Ming
    Ding, Kefeng
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VII, 2023, 14226 : 572 - 581
  • [10] Colorectal Polyp Segmentation Combining Pyramid Vision Transformer and Axial Attention
    Zhou, Xue
    Bai, Zhengyao
    Lu, Qianjie
    Fan, Shenglan
    [J]. Computer Engineering and Applications, 2023, 59 (11) : 222 - 230