SwinE-Net: hybrid deep learning approach to novel polyp segmentation using convolutional neural network and Swin Transformer

被引:69
|
作者
Park, Kyeong-Beom [1 ]
Lee, Jae Yeol [1 ]
机构
[1] Chonnam Natl Univ, Dept Ind Engn, 77,Yongbong Ro, Gwangju 61186, South Korea
基金
新加坡国家研究基金会;
关键词
polyp segmentation; convolutional neural networks; multidilation convolutional block; multifeature aggregation block; Swin Transformer; Vision Transformer; COLONOSCOPY;
D O I
10.1093/jcde/qwac018
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Prevention of colorectal cancer (CRC) by inspecting and removing colorectal polyps has become a global health priority because CRC is one of the most frequent cancers in the world. Although recent U-Net-based convolutional neural networks (CNNs) with deep feature representation and skip connections have shown to segment polyps effectively, U-Net-based approaches still have limitations in modeling explicit global contexts, due to the intrinsic nature locality of convolutional operations. To overcome these problems, this study proposes a novel deep learning model, SwinE-Net, for polyp segmentation that effectively combines a CNN-based EfficientNet and Vision Transformer (ViT)-based Swin Ttransformer. The main challenge is to conduct accurate and robust medical segmentation in maintaining global semantics without sacrificing low-level features of CNNs through Swin Transformer. First, the multidilation convolutional block generates refined feature maps to enhance feature discriminability for multilevel feature maps extracted from CNN and ViT. Then, the multifeature aggregation block creates intermediate side outputs from the refined polyp features for efficient training. Finally, the attentive deconvolutional network-based decoder upsamples the refined and combined feature maps to accurately segment colorectal polyps. We compared the proposed approach with previous state-of-the-art methods by evaluating various metrics using five public datasets (Kvasir, ClinicDB, ColonDB, ETIS, and EndoScene). The comparative evaluation, in particular, proved that the proposed approach showed much better performance in the unseen dataset, which shows the generalization and scalability in conducting polyp segmentation. Furthermore, an ablation study was performed to prove the novelty and advantage of the proposed network. The proposed approach outperformed previous studies.
引用
收藏
页码:616 / 632
页数:17
相关论文
共 50 条
  • [31] A Hybrid Deep Learning Approach for Skin Cancer Classification Using Swin Transformer and Dense Group Shuffle Non-Local Attention Network
    Karthik, R.
    Menaka, R.
    Atre, Shivansh
    Cho, Jaehyuk
    Easwaramoorthy, Sathishkumar Veerappampalayam
    IEEE ACCESS, 2024, 12 : 158040 - 158051
  • [32] A Novel Approach to Detect Drones Using Deep Convolutional Neural Network Architecture
    Rakshit, Hrishi
    Zadeh, Pooneh Bagheri
    SENSORS, 2024, 24 (14)
  • [33] Automated Polyp Segmentation in Colonoscopy Frames Using Fully Convolutional Neural Network and Textons
    Zhang, Lei
    Dolwani, Sunil
    Ye, Xujiong
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS (MIUA 2017), 2017, 723 : 707 - 717
  • [34] AIU-Net: An Efficient Deep Convolutional Neural Network for Brain Tumor Segmentation
    Jiang, Yongchao
    Ye, Mingquan
    Huang, Daobin
    Lu, Xiaojie
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [35] DRU-NET: AN EFFICIENT DEEP CONVOLUTIONAL NEURAL NETWORK FOR MEDICAL IMAGE SEGMENTATION
    Jafari, Mina
    Auer, Dorothee
    Francis, Susan
    Garibaldi, Jonathan
    Chen, Xin
    2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 1144 - 1148
  • [36] Deep Active Learning for Automatic Segmentation of Maxillary Sinus Lesions Using a Convolutional Neural Network
    Jung, Seok-Ki
    Lim, Ho-Kyung
    Lee, Seungjun
    Cho, Yongwon
    Song, In-Seok
    DIAGNOSTICS, 2021, 11 (04)
  • [37] Detecting brain tumors using deep learning convolutional neural network with transfer learning approach
    Anjum, Sadia
    Hussain, Lal
    Ali, Mushtaq
    Alkinani, Monagi H.
    Aziz, Wajid
    Gheller, Sabrina
    Abbasi, Adeel Ahmed
    Marchal, Ali Raza
    Suresh, Harshini
    Duong, Tim Q.
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2022, 32 (01) : 307 - 323
  • [38] A Novel Approach for Premature Detection of Alzheimer's Disease Using Convolutional Neural Network in Deep Learning Technique
    Bamini, A. M. Anusha
    Chitra, R.
    Brindha, D.
    Jegan, T. M. Chenthil
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 135 (02) : 639 - 654
  • [39] Automated segmentation of craniopharyngioma on MR images using U-Net-based deep convolutional neural network
    Chaoyue Chen
    Ting Zhang
    Yuen Teng
    Yijie Yu
    Xin Shu
    Lei Zhang
    Fumin Zhao
    Jianguo Xu
    European Radiology, 2023, 33 : 2665 - 2675
  • [40] Automated segmentation of craniopharyngioma on MR images using U-Net-based deep convolutional neural network
    Chen, Chaoyue
    Zhang, Ting
    Teng, Yuen
    Yu, Yijie
    Shu, Xin
    Zhang, Lei
    Zhao, Fumin
    Xu, Jianguo
    EUROPEAN RADIOLOGY, 2023, 33 (04) : 2665 - 2675