MCNMF-Unet: a mixture Conv-MLP network with multi-scale features fusion Unet for medical image segmentation

被引:3
|
作者
Yuan, Lei [1 ]
Song, Jianhua [1 ]
Fan, Yazhuo [1 ]
机构
[1] Minnan Normal Univ, Sch Phys & Informat Engn, Key Lab Light Field Manipulat & Syst Integrat Appl, Zhangzhou, Fujian, Peoples R China
关键词
Medical image segmentation; Unet; Vision transformer; MLP;
D O I
10.7717/peerj-cs.1798
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, the medical image segmentation scheme combining Vision Transformer (ViT) and multilayer perceptron (MLP) has been widely used. However, one of its disadvantages is that the feature fusion ability of different levels is weak and lacks flexible localization information. To reduce the semantic gap between the encoding and decoding stages, we propose a mixture conv-MLP network with multi-scale features fusion Unet (MCNMF-Unet) for medical image segmentation. MCNMF-Unet is a U-shaped network based on convolution and MLP, which not only inherits the advantages of convolutional in extracting underlying features and visual structures, but also utilizes MLP to fuse local and global information of each layer of the network. MCNMF-Unet performs multi-layer fusion and multi-scale feature map skip connections in each network stage so that all the feature information can be fully utilized and the gradient disappearance problem can be alleviated. Additionally, MCNMF-Unet incorporates a multi-axis and multi-windows MLP module. This module is fully end-to-end and eliminates the need to consider the negative impact of image cropping. It not only fuses information from multiple dimensions and receptive fields but also reduces the number of parameters and computational complexity. We evaluated the proposed model on BUSI, ISIC2018 and CVC-ClinicDB datasets. The experimental results show that the performance of our proposed model is superior to most existing networks, with an IoU of 84.04% anda F1-score of 91.18%.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] MDA-Unet: A Multi-Scale Dilated Attention U-Net for Medical Image Segmentation
    Amer, Alyaa
    Lambrou, Tryphon
    Ye, Xujiong
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [22] Multi-scale feature pyramid fusion network for medical image segmentation
    Bing Zhang
    Yang Wang
    Caifu Ding
    Ziqing Deng
    Linwei Li
    Zesheng Qin
    Zhao Ding
    Lifeng Bian
    Chen Yang
    [J]. International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 353 - 365
  • [23] Multi-scale feature pyramid fusion network for medical image segmentation
    Zhang, Bing
    Wang, Yang
    Ding, Caifu
    Deng, Ziqing
    Li, Linwei
    Qin, Zesheng
    Ding, Zhao
    Bian, Lifeng
    Yang, Chen
    [J]. INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (02) : 353 - 365
  • [24] Multi-scale nested UNet with transformer for colorectal polyp segmentation
    Wang, Zenan
    Liu, Zhen
    Yu, Jianfeng
    Gao, Yingxin
    Liu, Ming
    [J]. JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2024, 25 (06):
  • [25] UNet segmentation network of COVID-19 CT images with multi-scale attention
    Chen, Mingju
    Yi, Sihang
    Yang, Mei
    Yang, Zhiwen
    Zhang, Xingyue
    [J]. MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (09) : 16762 - 16785
  • [26] Light-UNet: An Efficient Segmentation Network for Medical Image
    Zhang, Yue
    Xu, Chao
    Zhang, Zhifan
    Wang, Jianjun
    [J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 302 - 313
  • [27] MR-UNet: An UNet model using multi-scale and residual convolutions for retinal vessel segmentation
    Yang, Xin
    Liu, Li
    Li, Tao
    [J]. INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2022, 32 (05) : 1588 - 1603
  • [28] Image defogging based on multi-input and multi-scale UNet
    Zhengchun Lin
    Qingxing Luo
    Yunzhi Jiang
    Jing Wang
    Siyuan Li
    Gongwen Cheng
    Zheng Genrang
    [J]. Signal, Image and Video Processing, 2023, 17 : 1143 - 1151
  • [29] EMED-UNet: An Efficient Multi-Encoder-Decoder Based UNet for Medical Image Segmentation
    Shah, Kashish D.
    Patel, Dhaval K.
    Thaker, Minesh P.
    Patel, Harsh A.
    Saikia, Manob Jyoti
    Ranger, Bryan J.
    [J]. IEEE ACCESS, 2023, 11 : 95253 - 95266
  • [30] Image defogging based on multi-input and multi-scale UNet
    Lin, Zhengchun
    Luo, Qingxing
    Jiang, Yunzhi
    Wang, Jing
    Li, Siyuan
    Cheng, Gongwen
    Genrang, Zheng
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 1143 - 1151