CasUNeXt: A Cascaded Transformer With Intra- and Inter-Scale Information for Medical Image Segmentation

被引:0
|
作者
Sun, Junding [1 ]
Zheng, Xiaopeng [1 ]
Wu, Xiaosheng [1 ]
Tang, Chaosheng [1 ]
Wang, Shuihua [1 ,2 ,3 ]
Zhang, Yudong [1 ,2 ,4 ]
机构
[1] Henan Polytech Univ, Comp Sci & Technol, Jiaozuo, Henan, Peoples R China
[2] Univ Leicester, Comp & Math Sci, Leicester, England
[3] Xian Jiaotong Liverpool Univ, Dept Biol Sci, Suzhou, Jiangsu, Peoples R China
[4] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Informat Technol, Jeddah, Saudi Arabia
基金
中国国家自然科学基金;
关键词
cascade; CNN; multi-scale features; transformer; PLUS PLUS;
D O I
10.1002/ima.23184
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to the Transformer's ability to capture long-range dependencies through Self-Attention, it has shown immense potential in medical image segmentation. However, it lacks the capability to model local relationships between pixels. Therefore, many previous approaches embedded the Transformer into the CNN encoder. However, current methods often fall short in modeling the relationships between multi-scale features, specifically the spatial correspondence between features at different scales. This limitation can result in the ineffective capture of scale differences for each object and the loss of features for small targets. Furthermore, due to the high complexity of the Transformer, it is challenging to integrate local and global information within the same scale effectively. To address these limitations, we propose a novel backbone network called CasUNeXt, which features three appealing design elements: (1) We use the idea of cascade to redesign the way CNN and Transformer are combined to enhance modeling the unique interrelationships between multi-scale information. (2) We design a Cascaded Scale-wise Transformer Module capable of cross-scale interactions. It not only strengthens feature extraction within a single scale but also models interactions between different scales. (3) We overhaul the multi-head Channel Attention mechanism to enable it to model context information in feature maps from multiple perspectives within the channel dimension. These design features collectively enable CasUNeXt to better integrate local and global information and capture relationships between multi-scale features, thereby improving the performance of medical image segmentation. Through experimental comparisons on various benchmark datasets, our CasUNeXt method exhibits outstanding performance in medical image segmentation tasks, surpassing the current state-of-the-art methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] IIAM: Intra and Inter Attention With Mutual Consistency Learning Network for Medical Image Segmentation
    Pang, Chen
    Lu, Xuequan
    Liu, Xiang
    Zhang, Renfeng
    Lyu, Lei
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (10) : 5971 - 5983
  • [32] Markov Random Field Based Dynamic Texture Segmentation Using Inter-scale Context
    Chen, Liqiu
    Qiao, Yulong
    2016 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION (ICIA), 2016, : 1924 - 1927
  • [33] Inter-scale information flow as a surrogate for downward causation that maintains spiral waves
    Ashikaga, Hiroshi
    James, Ryan G.
    CHAOS, 2018, 28 (07)
  • [34] Hybrid Transformer and Convolution for Medical Image Segmentation
    Wang, Fan
    Wang, Bo
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 156 - 159
  • [35] Medical Image Segmentation Using Transformer Networks
    Karimi, Davood
    Dou, Haoran
    Gholipour, Ali
    IEEE ACCESS, 2022, 10 : 29322 - 29332
  • [36] Intra- and Inter-Head Orthogonal Attention for Image Captioning
    Zhang, Xiaodan
    Jia, Aozhe
    Ji, Junzhong
    Qu, Liangqiong
    Ye, Qixiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 594 - 607
  • [37] ATFormer: Advanced transformer for medical image segmentation
    Chen, Yong
    Lu, Xuesong
    Xie, Oinlan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
  • [38] The Fully Convolutional Transformer for Medical Image Segmentation
    Tragakis, Athanasios
    Kaul, Chaitanya
    Murray-Smith, Roderick
    Husmeier, Dirk
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3649 - 3658
  • [39] Automatic Medical Image Segmentation with Vision Transformer
    Zhang, Jie
    Li, Fan
    Zhang, Xin
    Wang, Huaijun
    Hei, Xinhong
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [40] Coformer: Collaborative Transformer for Medical Image Segmentation
    Gao, Yufei
    Zhang, Shichao
    Zhang, Dandan
    Shi, Yucheng
    Zhao, Guohua
    Shi, Lei
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14864 : 240 - 250