SwinE-Net: hybrid deep learning approach to novel polyp segmentation using convolutional neural network and Swin Transformer

被引:69
|
作者
Park, Kyeong-Beom [1 ]
Lee, Jae Yeol [1 ]
机构
[1] Chonnam Natl Univ, Dept Ind Engn, 77,Yongbong Ro, Gwangju 61186, South Korea
基金
新加坡国家研究基金会;
关键词
polyp segmentation; convolutional neural networks; multidilation convolutional block; multifeature aggregation block; Swin Transformer; Vision Transformer; COLONOSCOPY;
D O I
10.1093/jcde/qwac018
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Prevention of colorectal cancer (CRC) by inspecting and removing colorectal polyps has become a global health priority because CRC is one of the most frequent cancers in the world. Although recent U-Net-based convolutional neural networks (CNNs) with deep feature representation and skip connections have shown to segment polyps effectively, U-Net-based approaches still have limitations in modeling explicit global contexts, due to the intrinsic nature locality of convolutional operations. To overcome these problems, this study proposes a novel deep learning model, SwinE-Net, for polyp segmentation that effectively combines a CNN-based EfficientNet and Vision Transformer (ViT)-based Swin Ttransformer. The main challenge is to conduct accurate and robust medical segmentation in maintaining global semantics without sacrificing low-level features of CNNs through Swin Transformer. First, the multidilation convolutional block generates refined feature maps to enhance feature discriminability for multilevel feature maps extracted from CNN and ViT. Then, the multifeature aggregation block creates intermediate side outputs from the refined polyp features for efficient training. Finally, the attentive deconvolutional network-based decoder upsamples the refined and combined feature maps to accurately segment colorectal polyps. We compared the proposed approach with previous state-of-the-art methods by evaluating various metrics using five public datasets (Kvasir, ClinicDB, ColonDB, ETIS, and EndoScene). The comparative evaluation, in particular, proved that the proposed approach showed much better performance in the unseen dataset, which shows the generalization and scalability in conducting polyp segmentation. Furthermore, an ablation study was performed to prove the novelty and advantage of the proposed network. The proposed approach outperformed previous studies.
引用
收藏
页码:616 / 632
页数:17
相关论文
共 50 条
  • [1] A Novel Deep Learning Model for Medical Image Segmentation with Convolutional Neural Network and Transformer
    Zhang, Zhuo
    Wu, Hongbing
    Zhao, Huan
    Shi, Yicheng
    Wang, Jifang
    Bai, Hua
    Sun, Baoshan
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2023, 15 (04) : 663 - 677
  • [2] A Novel Deep Learning Model for Medical Image Segmentation with Convolutional Neural Network and Transformer
    Zhuo Zhang
    Hongbing Wu
    Huan Zhao
    Yicheng Shi
    Jifang Wang
    Hua Bai
    Baoshan Sun
    Interdisciplinary Sciences: Computational Life Sciences, 2023, 15 : 663 - 677
  • [3] Hybrid semantic segmentation for tunnel lining cracks based on Swin Transformer and convolutional neural network
    Zhou, Zhong
    Zhang, Junjie
    Gong, Chenjie
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2023, 38 (17) : 2491 - 2510
  • [4] Vessels Segmentation in Angiograms Using Convolutional Neural Network: A Deep Learning Based Approach
    Roy, Sanjiban Sekhar
    Hsu, Ching-Hsien
    Samaran, Akash
    Goyal, Ranjan
    Pande, Arindam
    Balas, Valentina E.
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 136 (01): : 241 - 255
  • [5] CoVi-Net: A hybrid convolutional and vision transformer neural network for retinal vessel segmentation
    Jiang, Minshan
    Zhu, Yongfei
    Zhang, Xuedian
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 170
  • [6] A Hybrid Network Based on nnU-Net and Swin Transformer for Kidney Tumor Segmentation
    Qian, Lifei
    Luo, Ling
    Zhong, Yuanhong
    Zhong, Daidi
    KIDNEY AND KIDNEY TUMOR SEGMENTATION, KITS 2023, 2024, 14540 : 30 - 39
  • [7] A novel hybrid face mask detection approach using Transformer and convolutional neural network models
    Al-Sarrar, Haifa M.
    Al-Baity, Heyam H.
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [8] A novel hybrid face mask detection approach using Transformer and convolutional neural network models
    Al-Sarrar H.M.
    Al-Baity H.H.
    PeerJ Computer Science, 2023, 9
  • [9] Colorectal Polyp Segmentation Using A Fully Convolutional Neural Network
    Li, Qiaoliang
    Yang, Guangyao
    Chen, Zhewei
    Huang, Bin
    Chen, Liangliang
    Xu, Depeng
    Zhou, Xueying
    Zhong, Shi
    Zhang, Huisheng
    Wang, Tianfu
    2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI), 2017,
  • [10] A Deep Learning Approach for Brain Tumor Classification and Segmentation Using a Multiscale Convolutional Neural Network
    Diaz-Pernas, Francisco Javier
    Martinez-Zarzuela, Mario
    Anton-Rodriguez, Miriam
    Gonzalez-Ortega, David
    HEALTHCARE, 2021, 9 (02)