RM-UNet: UNet-like Mamba with rotational SSM module for medical image segmentation

被引:0
|
作者
Tang, Hao [1 ]
Huang, Guoheng [1 ]
Cheng, Lianglun [1 ]
Yuan, Xiaochen [2 ]
Tao, Qi [3 ]
Chen, Xuhang [4 ]
Zhong, Guo [5 ]
Yang, Xiaohui [6 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Peoples R China
[2] Macao Polytech Univ, Fac Appl Sci, Macau 999078, Peoples R China
[3] Guangdong Technion Israel Inst Technol, Dept Mech Engn Robot, Shantou 515063, Peoples R China
[4] Huizhou Univ, Sch Comp Sci & Engn, Huizhou 516007, Peoples R China
[5] Guangdong Univ Foreign Studies, Sch Informat Sci & Technol, Guangzhou 510006, Peoples R China
[6] Sun Yat sen Univ, Affiliated Hosp 3, Dept Gynecol, Guangzhou, Peoples R China
关键词
U-Net; State Space Models; Medical image segmentation; Mamba; LSIL; U-NET ARCHITECTURE;
D O I
10.1007/s11760-024-03484-8
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Accurate segmentation of tissues and lesions is crucial for disease diagnosis, treatment planning, and surgical navigation. Yet, the complexity of medical images presents significant challenges for traditional Convolutional Neural Networks and Transformer models due to their limited receptive fields or high computational complexity. State Space Models (SSMs) have recently shown notable vision performance, particularly Mamba and its variants. However, their feature extraction methods may not be sufficiently effective and retain some redundant structures, leaving room for parameter reduction. In response to these challenges, we introduce a methodology called Rotational Mamba-UNet, characterized by Residual Visual State Space (ResVSS) block and Rotational SSM Module. The ResVSS block is devised to mitigate network degradation caused by the diminishing efficacy of information transfer from shallower to deeper layers. Meanwhile, the Rotational SSM Module is devised to tackle the challenges associated with channel feature extraction within State Space Models. Finally, we propose a weighted multi-level loss function, which fully leverages the outputs of the decoder's three stages for supervision. We conducted experiments on ISIC17, ISIC18, CVC-300, Kvasir-SEG, CVC-ColonDB, Kvasir-Instrument datasets, and Low-grade Squamous Intraepithelial Lesion datasets provided by The Third Affiliated Hospital of Sun Yat-sen University, demonstrating the superior segmentation performance of our proposed RM-UNet. Additionally, compared to the previous VM-UNet, our model achieves a one-third reduction in parameters. Our code is available at https://github.com/Halo2Tang/RM-UNet.
引用
收藏
页码:8427 / 8443
页数:17
相关论文
共 50 条
  • [1] ConvWin-UNet: UNet-like hierarchical vision Transformer combined with convolution for medical image segmentation
    Feng, Xiaomeng
    Wang, Taiping
    Yang, Xiaohang
    Zhang, Minfei
    Guo, Wanpeng
    Wang, Weina
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (01) : 128 - 144
  • [2] Vision Mamba and xLSTM-UNet for medical image segmentation
    Xin Zhong
    Gehao Lu
    Hao Li
    Scientific Reports, 15 (1)
  • [3] UTR: A UNet-like transformer for efficient unsupervised medical image registration
    Qiu, Wei
    Xiong, Lianjin
    Li, Ning
    Wang, Yaobin
    Zhang, Yangsong
    IMAGE AND VISION COMPUTING, 2024, 150
  • [4] Multi-scale context UNet-like network with redesigned skip connections for medical image segmentation
    Qian, Ledan
    Wen, Caiyun
    Li, Yi
    Hu, Zhongyi
    Zhou, Xiao
    Xia, Xiaonyu
    Kim, Soo-Hyung
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 243
  • [5] PCAT-UNet: UNet-like network fused convolution and transformer for retinal vessel segmentation
    Chen, Danny
    Yang, Wenzhong
    Wang, Liejun
    Tan, Sixiang
    Lin, Jiangzhaung
    Bu, Wenxiu
    PLOS ONE, 2022, 17 (01):
  • [6] DRD-UNet, a UNet-Like Architecture for Multi-Class Breast Cancer Semantic Segmentation
    Ortega-Ruiz, Mauricio Alberto
    Karabag, Cefa
    Roman-Rangel, Edgar
    Reyes-Aldasoro, Constantino Carlos
    IEEE ACCESS, 2024, 12 : 40412 - 40424
  • [7] AFTer-UNet: Axial Fusion Transformer UNet for Medical Image Segmentation
    Yan, Xiangyi
    Tang, Hao
    Sun, Shanlin
    Ma, Haoyu
    Kong, Deying
    Xie, Xiaohui
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 3270 - 3280
  • [8] Twin-stage Unet-like network for single image deraining
    Zhou, Weina
    Wang, Xiu
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (02) : 1285 - 1293
  • [9] A Novel Elastomeric UNet for Medical Image Segmentation
    Cai, Sijing
    Wu, Yi
    Chen, Guannan
    FRONTIERS IN AGING NEUROSCIENCE, 2022, 14
  • [10] Semi-Mamba-UNet: Pixel-level contrastive and cross-supervised visual Mamba-based UNet for semi-supervised medical image segmentation
    Ma, Chao
    Wang, Ziyang
    KNOWLEDGE-BASED SYSTEMS, 2024, 300