SwinE-Net: hybrid deep learning approach to novel polyp segmentation using convolutional neural network and Swin Transformer

被引:69
|
作者
Park, Kyeong-Beom [1 ]
Lee, Jae Yeol [1 ]
机构
[1] Chonnam Natl Univ, Dept Ind Engn, 77,Yongbong Ro, Gwangju 61186, South Korea
基金
新加坡国家研究基金会;
关键词
polyp segmentation; convolutional neural networks; multidilation convolutional block; multifeature aggregation block; Swin Transformer; Vision Transformer; COLONOSCOPY;
D O I
10.1093/jcde/qwac018
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Prevention of colorectal cancer (CRC) by inspecting and removing colorectal polyps has become a global health priority because CRC is one of the most frequent cancers in the world. Although recent U-Net-based convolutional neural networks (CNNs) with deep feature representation and skip connections have shown to segment polyps effectively, U-Net-based approaches still have limitations in modeling explicit global contexts, due to the intrinsic nature locality of convolutional operations. To overcome these problems, this study proposes a novel deep learning model, SwinE-Net, for polyp segmentation that effectively combines a CNN-based EfficientNet and Vision Transformer (ViT)-based Swin Ttransformer. The main challenge is to conduct accurate and robust medical segmentation in maintaining global semantics without sacrificing low-level features of CNNs through Swin Transformer. First, the multidilation convolutional block generates refined feature maps to enhance feature discriminability for multilevel feature maps extracted from CNN and ViT. Then, the multifeature aggregation block creates intermediate side outputs from the refined polyp features for efficient training. Finally, the attentive deconvolutional network-based decoder upsamples the refined and combined feature maps to accurately segment colorectal polyps. We compared the proposed approach with previous state-of-the-art methods by evaluating various metrics using five public datasets (Kvasir, ClinicDB, ColonDB, ETIS, and EndoScene). The comparative evaluation, in particular, proved that the proposed approach showed much better performance in the unseen dataset, which shows the generalization and scalability in conducting polyp segmentation. Furthermore, an ablation study was performed to prove the novelty and advantage of the proposed network. The proposed approach outperformed previous studies.
引用
收藏
页码:616 / 632
页数:17
相关论文
共 50 条
  • [41] Swin-CFNet: An Attempt at Fine-Grained Urban Green Space Classification Using Swin Transformer and Convolutional Neural Network
    Wu, Yehong
    Zhang, Meng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [42] Seismic Impedance Inversion Using a Joint Deep Learning Model Based on Convolutional Neural Network and Transformer
    Fu, Jingcheng
    Fan, Rui
    Cao, Junxing
    Zhang, Xin
    Shi, Shaochen
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 8913 - 8922
  • [43] Deep Learning Based Automatic Liver Volume Estimation and Segmentation via U-net Convolutional Neural Network
    Marlatt, B.
    Pettit, R.
    Havelka, J.
    Corr, S. J.
    Rana, A.
    AMERICAN JOURNAL OF TRANSPLANTATION, 2021, 21 : 797 - 797
  • [44] Evolution of Image Segmentation using Deep Convolutional Neural Network: A Survey
    Sultana, Farhana
    Sufian, Abu
    Dutta, Paramartha
    KNOWLEDGE-BASED SYSTEMS, 2020, 201 (201-202)
  • [45] Segmentation of glioma tumors in brain using deep convolutional neural network
    Hussain, Saddam
    Anwar, Syed Muhammad
    Majid, Muhammad
    NEUROCOMPUTING, 2018, 282 : 248 - 261
  • [46] Image Segmentation of Salt Deposits Using Deep Convolutional Neural Network
    Liu, Bo
    Jing, Haipeng
    Li, Jianqiang
    Li, Yong
    Qu, Guangzhi
    Gu, Rentao
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3304 - 3309
  • [47] Side Scan Sonar Segmentation Using Deep Convolutional Neural Network
    Song, Yan
    Zhu, Yuemei
    Li, Guangliang
    Feng, Chen
    He, Bo
    Yan, Tianhong
    OCEANS 2017 - ANCHORAGE, 2017,
  • [48] Brain Tumor Segmentation using Cascaded Deep Convolutional Neural Network
    Hussain, Saddam
    Anwar, Syed Muhammad
    Majid, Muhammad
    2017 39TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2017, : 1998 - 2001
  • [49] CrackW-Net: A Novel Pavement Crack Image Segmentation Convolutional Neural Network
    Han, Chengjia
    Ma, Tao
    Huyan, Ju
    Huang, Xiaoming
    Zhang, Yanning
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 22135 - 22144
  • [50] Deep learning-based Raman spectroscopy qualitative analysis algorithm: A convolutional neural network and transformer approach
    Wang, Zilong
    Li, Yunfeng
    Zhai, Jinglei
    Yang, Siwei
    Sun, Biao
    Liang, Pei
    TALANTA, 2024, 275