Multi-TranResUnet: An Improved Transformer Network for Solving Multi-Scale Issues in Image Segmentation

被引:0
|
作者
Kang, Yajing [1 ]
Cheng, Shuai [1 ]
Guo, Liang [2 ]
Zheng, Chao [1 ]
Zhao, Jizhuang [1 ]
机构
[1] China Telecom Res Inst, Beijing 102209, Peoples R China
[2] CAICT, Inst Cloud Comp & Big Data, Beijing 100191, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Transformers; Image segmentation; Feature extraction; Medical diagnostic imaging; Convolutional neural networks; Accuracy; Computational modeling; Low latency communication; Medical image segmentation; deep learning; transformer; low-latency model;
D O I
10.1109/ACCESS.2024.3457823
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep-learning-driven medical image segmentation marks a significant milestone in the evolution of intelligent healthcare systems. Despite remarkable accuracy achievements, real-world clinical applications still grapple with complex challenges, particularly in handling multi-scale medical targets. This paper introduces a novel and efficient medical image segmentation network that leverages Transformer technology. The proposed network utilizes the Transformer's global feature extraction capabilities, enriched with spatial context, to substantially elevate segmentation accuracy. Additionally, the fusion encoder we build by combining Transformer modules and Convolutional structures through feature fusion strategies can improve feature extraction capabilities. Acknowledging the computational demands of Transformer models in practical scenarios, we have meticulously optimized our Transformer architecture. This optimization focuses on reducing parameter complexity and inference latency, tailoring the model to address the typical sample scarcity in medical applications. We evaluated our model on two different medical datasets: the 2018 Lesion Boundary Segmentation Challenge, the 2018 Data Science Bowl Challenge and the Kvasir-Instrument dataset. Our model demonstrates state-of-the-art performance in both Dice and MIoU metrics, while maintaining robust real-time processing capabilities. Our code will be released at https://github.com/migouKang/Multi-TranResUnet.
引用
收藏
页码:129000 / 129011
页数:12
相关论文
共 50 条
  • [21] Transformer-based Multi-scale Underwater Image Enhancement Network
    Yang, Ai-Ping
    Fang, Si-Jie
    Shao, Ming-Fu
    Zhang, Teng-Fei
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (12): : 1696 - 1705
  • [22] Multi-scale image semantic segmentation based on ASPP and improved HRNet
    Shi Jian-feng
    Gao Zhi-ming
    Wang A-chuan
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2021, 36 (11) : 1497 - 1505
  • [23] Multi-Scale and Multi-Branch Convolutional Neural Network for Retinal Image Segmentation
    Jiang, Yun
    Liu, Wenhuan
    Wu, Chao
    Yao, Huixiao
    SYMMETRY-BASEL, 2021, 13 (03): : 1 - 25
  • [24] MUSIQ: Multi-scale Image Quality Transformer
    Ke, Junjie
    Wang, Qifei
    Wang, Yilin
    Milanfar, Peyman
    Yang, Feng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5128 - 5137
  • [25] MAXFormer: Enhanced transformer for medical image segmentation with multi-attention and multi-scale features fusion
    Liang, Zhiwei
    Zhao, Kui
    Liang, Gang
    Li, Siyu
    Wu, Yifei
    Zhou, Yiping
    KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [26] Multi-scale feature pyramid fusion network for medical image segmentation
    Bing Zhang
    Yang Wang
    Caifu Ding
    Ziqing Deng
    Linwei Li
    Zesheng Qin
    Zhao Ding
    Lifeng Bian
    Chen Yang
    International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 353 - 365
  • [27] Attention based multi-scale nested network for biomedical image segmentation
    Cheng, Dapeng
    Deng, Jia
    Xiao, Jinjie
    Yanyan, Mao
    Kang, Jialong
    Gai, Jiale
    Zhang, Baosheng
    Zhao, Feng
    HELIYON, 2024, 10 (14)
  • [28] MSAANet: Multi-scale Axial Attention Network for medical image segmentation
    Zeng, Hao
    Shan, Xinxin
    Feng, Yu
    Wen, Ying
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2291 - 2296
  • [29] MSDANet: A multi-scale dilation attention network for medical image segmentation
    Zhang, Jinquan
    Luan, Zhuang
    Ni, Lina
    Qi, Liang
    Gong, Xu
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 90
  • [30] Multi-scale feature pyramid fusion network for medical image segmentation
    Zhang, Bing
    Wang, Yang
    Ding, Caifu
    Deng, Ziqing
    Li, Linwei
    Qin, Zesheng
    Ding, Zhao
    Bian, Lifeng
    Yang, Chen
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (02) : 353 - 365