MCPA: multi-scale cross perceptron attention network for 2D medical image segmentation

被引:0
|
作者
Liang Xu [1 ]
Mingxiao Chen [2 ]
Yi Cheng [3 ]
Pengwu Song [3 ]
Pengfei Shao [1 ]
Shuwei Shen [2 ]
Peng Yao [3 ]
Ronald X. Xu [1 ]
机构
[1] University of Science and Technology of China,School of Biomedical Engineering Division of Life Sciences and Medicine
[2] Suzhou Institute for Advanced Research,Department of Precision Machinery and Precision Instrument
[3] University of Science and Technology of China,School of Microelectronics
[4] University of Science and Technology of China,undefined
[5] University of Science and Technology of China,undefined
关键词
Medical image; Segmentation; Multi-scale; Cross perceptron; Progressive dual-branch structure;
D O I
10.1007/s40747-024-01671-1
中图分类号
学科分类号
摘要
The UNet architecture, based on convolutional neural networks (CNN), has demonstrated its remarkable performance in medical image analysis. However, it faces challenges in capturing long-range dependencies due to the limited receptive fields and inherent bias of convolutional operations. Recently, numerous transformer-based techniques have been incorporated into the UNet architecture to overcome this limitation by effectively capturing global feature correlations. However, the integration of the Transformer modules may result in the loss of local contextual information during the global feature fusion process. In this work, we propose a 2D medical image segmentation model called multi-scale cross perceptron attention network (MCPA). The MCPA consists of three main components: an encoder, a decoder, and a Cross Perceptron. The Cross Perceptron first captures the local correlations using multiple Multi-scale Cross Perceptron modules, facilitating the fusion of features across scales. The resulting multi-scale feature vectors are then spatially unfolded, concatenated, and fed through a Global Perceptron module to model global dependencies. Considering the high computational cost of using 3D neural network models, and the fact that many important clinical data can only be obtained in two dimensions, our MCPA focuses on 2D medical image segmentation. Furthermore, we introduce a progressive dual-branch structure (PDBS) to address the semantic segmentation of the image involving finer tissue structures. This structure gradually shifts the segmentation focus of MCPA network training from large-scale structural features to more sophisticated pixel-level features. We evaluate our proposed MCPA model on several publicly available medical image datasets from different tasks and devices, including the open large-scale dataset of CT (Synapse), MRI (ACDC), and widely used 2D medical imaging datasets captured by fundus camera (DRIVE, CHASE_\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\_$$\end{document}DB1, HRF), and OCTA (ROSE). The experimental results show that our MCPA model achieves state-of-the-art performance.
引用
收藏
相关论文
共 50 条
  • [31] Multi-scale attention network for image inpainting
    Qin, Jia
    Bai, Huihui
    Zhao, Yao
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 204
  • [32] MRAU-net: Multi-scale residual attention U-shaped network for medical image segmentation
    Shu, Xin
    Li, Xiaotong
    Zhang, Xin
    Shao, Changbin
    Yan, Xi
    Huang, Shucheng
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 118
  • [33] A Multi-Scale Channel Attention Network for Prostate Segmentation
    Ding, Meiwen
    Lin, Zhiping
    Lee, Chau Hung
    Tan, Cher Heng
    Huang, Weimin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (05) : 1754 - 1758
  • [34] MSA-Net: Multi-scale feature fusion network with enhanced attention module for 3D medical image segmentation
    Wang, Shuo
    Wang, Yuanhong
    Peng, Yanjun
    Chen, Xue
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 120
  • [35] Multi-scale Hierarchical Vision Transformer with Cascaded Attention Decoding for Medical Image Segmentation
    Rahman, Md Mostafijur
    Marculescu, Radu
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 1526 - 1544
  • [36] DSA: Deformable Segmentation Attention for Multi-Scale Fisheye Image Segmentation
    Jiang, Junzhe
    Xu, Cheng
    Liu, Hongzhe
    Fu, Ying
    Jian, Muwei
    ELECTRONICS, 2023, 12 (19)
  • [37] Feature ensemble network for medical image segmentation with multi-scale atrous transformer
    Gai, Di
    Geng, Yuhan
    Huang, Xia
    Huang, Zheng
    Xiong, Xin
    Zhou, Ruihua
    Wang, Qi
    IET IMAGE PROCESSING, 2024, 18 (11) : 3082 - 3092
  • [38] Medical image segmentation network based on multi-scale frequency domain filter
    Chen, Yufeng
    Zhang, Xiaoqian
    Peng, Lifan
    He, Youdong
    Sun, Feng
    Sun, Huaijiang
    NEURAL NETWORKS, 2024, 175
  • [39] Sub-pixel multi-scale fusion network for medical image segmentation
    Jing Li
    Qiaohong Chen
    Xian Fang
    Multimedia Tools and Applications, 2024, 83 (41) : 89355 - 89373
  • [40] Semantic segmentation network for remote sensing image based on multi-scale mutual attention
    Liu C.-J.
    Qiao Z.
    Yan H.-W.
    Wu X.-S.
    Wang J.-W.
    Xin Y.-Q.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (07): : 1335 - 1344