Multi-TranResUnet: An Improved Transformer Network for Solving Multi-Scale Issues in Image Segmentation

被引:0
|
作者
Kang, Yajing [1 ]
Cheng, Shuai [1 ]
Guo, Liang [2 ]
Zheng, Chao [1 ]
Zhao, Jizhuang [1 ]
机构
[1] China Telecom Res Inst, Beijing 102209, Peoples R China
[2] CAICT, Inst Cloud Comp & Big Data, Beijing 100191, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Transformers; Image segmentation; Feature extraction; Medical diagnostic imaging; Convolutional neural networks; Accuracy; Computational modeling; Low latency communication; Medical image segmentation; deep learning; transformer; low-latency model;
D O I
10.1109/ACCESS.2024.3457823
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep-learning-driven medical image segmentation marks a significant milestone in the evolution of intelligent healthcare systems. Despite remarkable accuracy achievements, real-world clinical applications still grapple with complex challenges, particularly in handling multi-scale medical targets. This paper introduces a novel and efficient medical image segmentation network that leverages Transformer technology. The proposed network utilizes the Transformer's global feature extraction capabilities, enriched with spatial context, to substantially elevate segmentation accuracy. Additionally, the fusion encoder we build by combining Transformer modules and Convolutional structures through feature fusion strategies can improve feature extraction capabilities. Acknowledging the computational demands of Transformer models in practical scenarios, we have meticulously optimized our Transformer architecture. This optimization focuses on reducing parameter complexity and inference latency, tailoring the model to address the typical sample scarcity in medical applications. We evaluated our model on two different medical datasets: the 2018 Lesion Boundary Segmentation Challenge, the 2018 Data Science Bowl Challenge and the Kvasir-Instrument dataset. Our model demonstrates state-of-the-art performance in both Dice and MIoU metrics, while maintaining robust real-time processing capabilities. Our code will be released at https://github.com/migouKang/Multi-TranResUnet.
引用
收藏
页码:129000 / 129011
页数:12
相关论文
共 50 条
  • [31] Lightweight multi-scale dynamic selection network for medical image segmentation
    Dong, Xue-Mei
    Sun, Yu
    Wang, Lili
    INFORMATION SCIENCES, 2024, 677
  • [32] Multi-scale Convolutional Neural Network for SAR Image Semantic Segmentation
    Duan, Yiping
    Tao, Xiaoming
    Han, Chaoyi
    Qin, Xiaowei
    Lu, Jianhua
    2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [33] Multi-scale morphological simplification for image segmentation
    Lu, GM
    Yang, Z
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 484 - 487
  • [34] Multi-scale image segmentation based on morphology
    Wang, XP
    Hao, CY
    Fan, YY
    Xi, YL
    CHINESE JOURNAL OF ELECTRONICS, 2005, 14 (01): : 119 - 121
  • [35] Multi-scale Image Co-segmentation
    Es-Salhi, Rachida
    Daoudi, Imane
    Weber, Jonathan
    El Ouardi, Hamid
    Tallal, Saida
    Medromi, Hicham
    ADVANCES IN UBIQUITOUS NETWORKING, 2016, 366 : 381 - 390
  • [36] REPRESENTATION OF IMAGE CONTENT WITH MULTI-SCALE SEGMENTATION
    Zhang, Jing
    Zhao, Ya-Xin
    Li, Da
    Chen, Zhi-Hua
    Yuan, Yu-Bo
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 1552 - 1555
  • [37] Multi-scale Hierarchical Vision Transformer with Cascaded Attention Decoding for Medical Image Segmentation
    Rahman, Md Mostafijur
    Marculescu, Radu
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 1526 - 1544
  • [38] SMVT: Spectrum-Driven Multi-scale Vision Transformer for Referring Image Segmentation
    Li, Tianxiao
    Chen, Junhong
    Huang, Yiheng
    Huang, Kesi
    Xia, Qiqiang
    Asim, Muhammad
    Liu, Wenyin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 193 - 206
  • [39] An Improved Transformer Network With Multi-Scale Convolution for Weed Identification in Sugarcane Field
    Sun, Cuimin
    Zhang, Menghua
    Zhou, Muchen
    Zhou, Xingzhi
    IEEE ACCESS, 2024, 12 : 31168 - 31181
  • [40] Multi-scale and multi-patch transformer for sandstorm image enhancement
    Liang, Pengwei
    Ding, Wenyu
    Fan, Lu
    Wang, Haoyu
    Li, Zihong
    Yang, Fan
    Wang, Bo
    Li, Chongyi
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 89