Vision Mamba and xLSTM-UNet for medical image segmentation

被引:0
|
作者
Xin Zhong [1 ]
Gehao Lu [1 ]
Hao Li [1 ]
机构
[1] Yunnan University,School of Information Science and Engineering
关键词
Deep Learning; Medical Image Segmentation; SSM; XLSTM;
D O I
10.1038/s41598-025-88967-5
中图分类号
学科分类号
摘要
Deep learning-based medical image segmentation methods are generally divided into convolutional neural networks (CNNs) and Transformer-based models. Traditional CNNs are limited by their receptive field, making it challenging to capture long-range dependencies. While Transformers excel at modeling global information, their high computational complexity restricts their practical application in clinical scenarios. To address these limitations, this study introduces VMAXL-UNet, a novel segmentation network that integrates Structured State Space Models (SSM) and lightweight LSTMs (xLSTM). The network incorporates Visual State Space (VSS) and ViL modules in the encoder to efficiently fuse local boundary details with global semantic context. The VSS module leverages SSM to capture long-range dependencies and extract critical features from distant regions. Meanwhile, the ViL module employs a gating mechanism to enhance the integration of local and global features, thereby improving segmentation accuracy and robustness. Experiments on datasets such as ISIC17, ISIC18, CVC-ClinicDB, and Kvasir demonstrate that VMAXL-UNet significantly outperforms traditional CNNs and Transformer-based models in capturing lesion boundaries and their distant correlations. These results highlight the model’s superior performance and provide a promising approach for efficient segmentation in complex medical imaging scenarios.
引用
收藏
相关论文
共 50 条
  • [1] RM-UNet: UNet-like Mamba with rotational SSM module for medical image segmentation
    Tang, Hao
    Huang, Guoheng
    Cheng, Lianglun
    Yuan, Xiaochen
    Tao, Qi
    Chen, Xuhang
    Zhong, Guo
    Yang, Xiaohui
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, : 8427 - 8443
  • [2] Semi-Mamba-UNet: Pixel-level contrastive and cross-supervised visual Mamba-based UNet for semi-supervised medical image segmentation
    Ma, Chao
    Wang, Ziyang
    KNOWLEDGE-BASED SYSTEMS, 2024, 300
  • [3] VIG-UNET: VISION GRAPH NEURAL NETWORKS FOR MEDICAL IMAGE SEGMENTATION
    Jiang, Juntao
    Chen, Xiyu
    Tian, Guanzhong
    Liu, Yong
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [4] ConvWin-UNet: UNet-like hierarchical vision Transformer combined with convolution for medical image segmentation
    Feng, Xiaomeng
    Wang, Taiping
    Yang, Xiaohang
    Zhang, Minfei
    Guo, Wanpeng
    Wang, Weina
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (01) : 128 - 144
  • [5] A Novel Elastomeric UNet for Medical Image Segmentation
    Cai, Sijing
    Wu, Yi
    Chen, Guannan
    FRONTIERS IN AGING NEUROSCIENCE, 2022, 14
  • [6] Improved UNet with Attention for Medical Image Segmentation
    AL Qurri, Ahmed
    Almekkawy, Mohamed
    SENSORS, 2023, 23 (20)
  • [7] AFTer-UNet: Axial Fusion Transformer UNet for Medical Image Segmentation
    Yan, Xiangyi
    Tang, Hao
    Sun, Shanlin
    Ma, Haoyu
    Kong, Deying
    Xie, Xiaohui
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 3270 - 3280
  • [9] Cascade Residual Multiscale Convolution and Mamba-Structured UNet for Advanced Brain Tumor Image Segmentation
    Zhou, Rui
    Wang, Ju
    Xia, Guijiang
    Xing, Jingyang
    Shen, Hongming
    Shen, Xiaoyan
    ENTROPY, 2024, 26 (05)
  • [10] UNET 3+: A FULL-SCALE CONNECTED UNET FOR MEDICAL IMAGE SEGMENTATION
    Huang, Huimin
    Lin, Lanfen
    Tong, Ruofeng
    Hu, Hongjie
    Zhang, Qiaowei
    Iwamoto, Yutaro
    Han, Xianhua
    Chen, Yen-Wei
    Wu, Jian
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1055 - 1059