Multi-TranResUnet: An Improved Transformer Network for Solving Multi-Scale Issues in Image Segmentation

被引:0
|
作者
Kang, Yajing [1 ]
Cheng, Shuai [1 ]
Guo, Liang [2 ]
Zheng, Chao [1 ]
Zhao, Jizhuang [1 ]
机构
[1] China Telecom Res Inst, Beijing 102209, Peoples R China
[2] CAICT, Inst Cloud Comp & Big Data, Beijing 100191, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Transformers; Image segmentation; Feature extraction; Medical diagnostic imaging; Convolutional neural networks; Accuracy; Computational modeling; Low latency communication; Medical image segmentation; deep learning; transformer; low-latency model;
D O I
10.1109/ACCESS.2024.3457823
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep-learning-driven medical image segmentation marks a significant milestone in the evolution of intelligent healthcare systems. Despite remarkable accuracy achievements, real-world clinical applications still grapple with complex challenges, particularly in handling multi-scale medical targets. This paper introduces a novel and efficient medical image segmentation network that leverages Transformer technology. The proposed network utilizes the Transformer's global feature extraction capabilities, enriched with spatial context, to substantially elevate segmentation accuracy. Additionally, the fusion encoder we build by combining Transformer modules and Convolutional structures through feature fusion strategies can improve feature extraction capabilities. Acknowledging the computational demands of Transformer models in practical scenarios, we have meticulously optimized our Transformer architecture. This optimization focuses on reducing parameter complexity and inference latency, tailoring the model to address the typical sample scarcity in medical applications. We evaluated our model on two different medical datasets: the 2018 Lesion Boundary Segmentation Challenge, the 2018 Data Science Bowl Challenge and the Kvasir-Instrument dataset. Our model demonstrates state-of-the-art performance in both Dice and MIoU metrics, while maintaining robust real-time processing capabilities. Our code will be released at https://github.com/migouKang/Multi-TranResUnet.
引用
收藏
页码:129000 / 129011
页数:12
相关论文
共 50 条
  • [1] Feature ensemble network for medical image segmentation with multi-scale atrous transformer
    Gai, Di
    Geng, Yuhan
    Huang, Xia
    Huang, Zheng
    Xiong, Xin
    Zhou, Ruihua
    Wang, Qi
    IET IMAGE PROCESSING, 2024, 18 (11) : 3082 - 3092
  • [2] MulTNet: A Multi-Scale Transformer Network for Marine Image Segmentation toward Fishing
    Xu, Xi
    Qin, Yi
    Xi, Dejun
    Ming, Ruotong
    Xia, Jie
    SENSORS, 2022, 22 (19)
  • [3] MULTI-SCALE CONVOLUTION-TRANSFORMER FUSION NETWORK FOR ENDOSCOPIC IMAGE SEGMENTATION
    Zou, Baosheng
    Zhou, Zongguang
    Han, Ying
    Li, Kang
    Wang, Guotai
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [4] An improved multi-scale feature extraction network for medical image segmentation
    Guo, Haoyu
    Shi, Liuliu
    Liu, Jinlong
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2024, 14 (12) : 8331 - 8346
  • [5] Multi-Scale Transformer Network for Hyperspectral Image Denoising
    Hu, Shuai
    Hu, Yikun
    Lin, Junyan
    Gao, Feng
    Dong, Junyu
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 7328 - 7331
  • [6] Grouped multi-scale vision transformer for medical image segmentation
    Zexuan Ji
    Zheng Chen
    Xiao Ma
    Scientific Reports, 15 (1)
  • [7] Remote sensing image instance segmentation network with transformer and multi-scale feature representation
    Ye, Wenhui
    Zhang, Wei
    Lei, Weimin
    Zhang, Wenchao
    Chen, Xinyi
    Wang, Yanwen
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 234
  • [8] An effective multi-scale interactive fusion network with hybrid Transformer and CNN for smoke image segmentation
    Li, Kang
    Yuan, Feiniu
    Wang, Chunmei
    PATTERN RECOGNITION, 2025, 159
  • [9] MSGAT: Multi-scale gated axial reverse attention transformer network for medical image segmentation
    Liu, Yanjun
    Yun, Haijiao
    Xia, Yang
    Luan, Jinyang
    Li, Mingjing
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 95
  • [10] Multi-scale Channel Transformer Network for Single Image Deraining
    Namba, Yuto
    Han, Xian-Hua
    PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,