Vision Mamba and xLSTM-UNet for medical image segmentation

被引：0

作者：

Xin Zhong ^{[1
]}

Gehao Lu ^{[1
]}

Hao Li ^{[1
]}

机构：

[1] Yunnan University,School of Information Science and Engineering

来源：

Scientific Reports | / 15卷 / 1期

关键词：

Deep Learning; Medical Image Segmentation; SSM; XLSTM;

D O I：

10.1038/s41598-025-88967-5

中图分类号：

学科分类号：

摘要：

Deep learning-based medical image segmentation methods are generally divided into convolutional neural networks (CNNs) and Transformer-based models. Traditional CNNs are limited by their receptive field, making it challenging to capture long-range dependencies. While Transformers excel at modeling global information, their high computational complexity restricts their practical application in clinical scenarios. To address these limitations, this study introduces VMAXL-UNet, a novel segmentation network that integrates Structured State Space Models (SSM) and lightweight LSTMs (xLSTM). The network incorporates Visual State Space (VSS) and ViL modules in the encoder to efficiently fuse local boundary details with global semantic context. The VSS module leverages SSM to capture long-range dependencies and extract critical features from distant regions. Meanwhile, the ViL module employs a gating mechanism to enhance the integration of local and global features, thereby improving segmentation accuracy and robustness. Experiments on datasets such as ISIC17, ISIC18, CVC-ClinicDB, and Kvasir demonstrate that VMAXL-UNet significantly outperforms traditional CNNs and Transformer-based models in capturing lesion boundaries and their distant correlations. These results highlight the model’s superior performance and provide a promising approach for efficient segmentation in complex medical imaging scenarios.

引用

共 50 条

[1] RM-UNet: UNet-like Mamba with rotational SSM module for medical image segmentation
Tang, Hao
Huang, Guoheng
Cheng, Lianglun
Yuan, Xiaochen
Tao, Qi
Chen, Xuhang
Zhong, Guo
Yang, Xiaohui
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, : 8427 - 8443
[2] Semi-Mamba-UNet: Pixel-level contrastive and cross-supervised visual Mamba-based UNet for semi-supervised medical image segmentation
Ma, Chao
Wang, Ziyang
KNOWLEDGE-BASED SYSTEMS, 2024, 300
[3] VIG-UNET: VISION GRAPH NEURAL NETWORKS FOR MEDICAL IMAGE SEGMENTATION
Jiang, Juntao
Chen, Xiyu
Tian, Guanzhong
Liu, Yong
2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
[4] ConvWin-UNet: UNet-like hierarchical vision Transformer combined with convolution for medical image segmentation
Feng, Xiaomeng
Wang, Taiping
Yang, Xiaohang
Zhang, Minfei
Guo, Wanpeng
Wang, Weina
MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (01) : 128 - 144
[5] A Novel Elastomeric UNet for Medical Image Segmentation
Cai, Sijing
Wu, Yi
Chen, Guannan
FRONTIERS IN AGING NEUROSCIENCE, 2022, 14
[6] Improved UNet with Attention for Medical Image Segmentation
AL Qurri, Ahmed
Almekkawy, Mohamed
SENSORS, 2023, 23 (20)
[7] AFTer-UNet: Axial Fusion Transformer UNet for Medical Image Segmentation
Yan, Xiangyi
Tang, Hao
Sun, Shanlin
Ma, Haoyu
Kong, Deying
Xie, Xiaohui
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 3270 - 3280
[8] Selective and multi-scale fusion Mamba for medical image segmentation
Huang, Qinghua (qhhuang@nwpu.edu.cn), 2025, 261
[9] Cascade Residual Multiscale Convolution and Mamba-Structured UNet for Advanced Brain Tumor Image Segmentation
Zhou, Rui
Wang, Ju
Xia, Guijiang
Xing, Jingyang
Shen, Hongming
Shen, Xiaoyan
ENTROPY, 2024, 26 (05)
[10] UNET 3+: A FULL-SCALE CONNECTED UNET FOR MEDICAL IMAGE SEGMENTATION
Huang, Huimin
Lin, Lanfen
Tong, Ruofeng
Hu, Hongjie
Zhang, Qiaowei
Iwamoto, Yutaro
Han, Xianhua
Chen, Yen-Wei
Wu, Jian
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1055 - 1059

← 1 2 3 4 5 →