MetaFormer and CNN Hybrid Model for Polyp Image Segmentation

被引:0
|
作者
Lee, Hyunnam [1 ]
Yoo, Juhan [2 ]
机构
[1] Incheon Int Airport Corp, Incheon 22382, South Korea
[2] Semyung Univ, Dept Elect Engn, Jecheon Si 27136, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Convolutional neural network; image segmentation; medical image processing; MetaFormer; polyp segmentation; vision transformer; VALIDATION;
D O I
10.1109/ACCESS.2024.3461754
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Transformer-based methods have become dominant in the medical image research field since the Vision Transformer achieved superior performance. Although transformer-based approaches have resolved long-range dependency problems inherent in Convolutional Neural Network (CNN) methods, they struggle to capture local detail information. Recent research focuses on the robust combination of local detail and semantic information. To address this problem, we propose a novel transformer-CNN hybrid network named RAPUNet. The proposed approach employs MetaFormer as the transformer backbone and introduces a custom convolutional block, RAPU (Residual and Atrous Convolution in Parallel Unit), to enhance local features and alleviate the combination problem of local and global features. We evaluate the segmentation performance of RAPUNet on popular benchmarking datasets for polyp segmentation, including Kvasir-SEG, CVC-ClinicDB, CVC-ColonDB, EndoScene-CVC300, and ETIS-LaribPolypDB. Experimental results show that our model achieves competitive performance in terms of mean Dice and mean IoU. Particularly, RAPUNet outperforms state-of-the-art methods on the CVC-ClinicDB dataset. Code available: https://github.com/hyunnamlee/RAPUNet.
引用
收藏
页码:133694 / 133702
页数:9
相关论文
共 50 条
  • [11] Hybrid CNN-Transformer model for medical image segmentation with pyramid convolution and multi-layer perceptron
    Liu, Xiaowei
    Hu, Yikun
    Chen, Jianguo
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
  • [12] HyFormer: a hybrid transformer-CNN architecture for retinal OCT image segmentation
    Jiang, Qingxin
    Fan, Ying
    Li, Menghan
    Fang, Sheng
    Zhu, Weifang
    Xiang, Dehui
    Peng, Tao
    Chen, Xinjian
    Xu, Xun
    Shi, Fei
    BIOMEDICAL OPTICS EXPRESS, 2024, 15 (11): : 6156 - 6170
  • [13] A hybrid level set model for image segmentation
    Chen, Weiqin
    Liu, Changjiang
    Basu, Anup
    Pan, Bin
    PLOS ONE, 2021, 16 (06):
  • [14] HCTNet: A hybrid CNN-transformer network for breast ultrasound image segmentation
    He, Qiqi
    Yang, Qiuju
    Xie, Minghao
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 155
  • [15] CSegNet: a hybrid transformer-CNN network for road crack image segmentation
    Dong, Hao
    Du, Yinlai
    Feng, Dong
    Hu, Qingyuan
    Zhou, Mingzhu
    Xing, Jun
    Zhang, Long
    Wang, Shu
    Liu, Yong
    INSIGHT, 2024, 66 (12) : 737 - 746
  • [16] CMNet: a novel model and design rationale based on comparison studies and synergy of CNN and MetaFormer
    Yu, Haowen
    Chen, Liming
    MACHINE VISION AND APPLICATIONS, 2023, 34 (06)
  • [17] CMNet: a novel model and design rationale based on comparison studies and synergy of CNN and MetaFormer
    Haowen Yu
    Liming Chen
    Machine Vision and Applications, 2023, 34
  • [18] A Theoretical Study on Prenatal Hydronephrosis: Image Segmentation Techniques and CNN Model
    Janani, N.
    Raman, Valliappan
    2024 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBER PHYSICAL SYSTEMS AND INTERNET OF THINGS, ICOICI 2024, 2024, : 1254 - 1259
  • [19] Image Segmentation and Classification Using CNN Model to Detect Brain Tumors
    Hilles, Shadi M. S.
    Saleh, Noor S.
    2ND INTERNATIONAL INFORMATICS AND SOFTWARE ENGINEERING CONFERENCE (IISEC), 2021,
  • [20] Improving Depth Estimation by Embedding Semantic Segmentation: A Hybrid CNN Model
    Valdez-Rodriguez, Jose E.
    Calvo, Hiram
    Felipe-Riveron, Edgardo
    Moreno-Armendariz, Marco A.
    SENSORS, 2022, 22 (04)