LeViT-UNet: Make Faster Encoders with Transformer for Medical Image Segmentation

被引：80

作者：

Xu, Guoping ^{[1
]}

Zhang, Xuan ^{[1
]}

He, Xinwei ^{[2
]}

Wu, Xinglong ^{[1
]}

机构：

[1] Wuhan Inst Technol, Sch Comp Sci & Engn, Hubei Key Lab Intelligent Robot, Wuhan 430205, Hubei, Peoples R China

[2] Huazhong Agr Univ, Coll Informat, Wuhan 430070, Hubei, Peoples R China

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII | 2024年 / 14432卷

关键词：

Medical Image Segmentation; Transformer; Convolutional Neural Network;

D O I：

10.1007/978-981-99-8543-2_4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Medical image segmentation plays an essential role in developing computer-assisted diagnosis and treatment systems, yet it still faces numerous challenges. In the past few years, Convolutional Neural Networks (CNNs) have been successfully applied to the task of medical image segmentation. Regrettably, due to the locality of convolution operations, these CNN-based architectures have their limitations in learning global context information in images, which might be crucial to the success of medical image segmentation. Meanwhile, the vision Transformer (ViT) architectures own the remarkable ability to extract long-range semantic features with the shortcoming of their computation complexity. To make medical image segmentation more efficient and accurate, we present a novel light-weight architecture named LeViT-UNet, which integrates multi-stage Transformer blocks in the encoder via LeViT, aiming to explore the effectiveness of fusion between local and global features together. Our experiments on two challenging segmentation benchmarks indicate that the proposed LeViT-UNet achieved competitive performance compared with various state-of-the-art methods in terms of efficiency and accuracy, suggesting that LeViT can be a faster feature encoder for medical images segmentation. LeViT-UNet-384, for instance, achieves Dice similarity coefficient (DSC) of 78.53% and 90.32% with a segmentation speed of 85 frames per second (FPS) in the Synapse and ACDC datasets, respectively. Therefore, the proposed architecture could be beneficial for prospective clinic trials conducted by the radiologists. Our source codes are publicly available at https://github.com/apple1986/LeViT_UNet.

引用

页码：42 / 53

页数：12

共 50 条

[11] A Novel Elastomeric UNet for Medical Image Segmentation
Cai, Sijing
Wu, Yi
Chen, Guannan
FRONTIERS IN AGING NEUROSCIENCE, 2022, 14
[12] Improved UNet with Attention for Medical Image Segmentation
AL Qurri, Ahmed
Almekkawy, Mohamed
SENSORS, 2023, 23 (20)
[13] LATrans-Unet: Improving CNN-Transformer with Location Adaptive for Medical Image Segmentation
Lin, Qiqin
Yao, Junfeng
Hong, Qingqi
Cao, Xianpeng
Zhou, Rongzhou
Xie, Weixing
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 223 - 234
[14] TGDAUNet: Transformer and GCNN based dual-branch attention UNet for medical image segmentation
Song, Pengfei
Li, Jinjiang
Fan, Hui
Fan, Linwei
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 167
[15] Multiresolution Aggregation Transformer UNet Based on Multiscale Input and Coordinate Attention for Medical Image Segmentation
Chen, Shaolong
Qiu, Changzhen
Yang, Weiping
Zhang, Zhiyong
SENSORS, 2022, 22 (10)
[16] RT-Unet: An advanced network based on residual network and transformer for medical image segmentation
Li, Bo
Liu, Sikai
Wu, Fei
Li, GuangHui
Zhong, Meiling
Guan, Xiaohui
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 8565 - 8582
[17] AFC-Unet: Attention-fused full-scale CNN-transformer unet for medical image segmentation
Meng, Wenjie
Liu, Shujun
Wang, Huajun
Biomedical Signal Processing and Control, 2025, 99
[18] H2MaT-Unet:Hierarchical hybrid multi-axis transformer based Unet for medical image segmentation
Ju Z.
Zhou Z.
Qi Z.
Yi C.
Computers in Biology and Medicine, 2024, 174
[19] ResTrans-Unet: A Residual-Aware Transformer-Based Approach to Medical Image Segmentation
Ma, Fengying
Wang, Zhi
Ji, Peng
Fu, Chengcai
Wang, Feng
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (04)
[20] seUNet-Trans: A Simple Yet Effective UNet-Transformer Model for Medical Image Segmentation
Pham, Tan-Hanh
Li, Xianqi
Nguyen, Kim-Doang
IEEE ACCESS, 2024, 12 : 122139 - 122154

← 1 2 3 4 5 →