CasUNeXt: A Cascaded Transformer With Intra- and Inter-Scale Information for Medical Image Segmentation

被引:0
|
作者
Sun, Junding [1 ]
Zheng, Xiaopeng [1 ]
Wu, Xiaosheng [1 ]
Tang, Chaosheng [1 ]
Wang, Shuihua [1 ,2 ,3 ]
Zhang, Yudong [1 ,2 ,4 ]
机构
[1] Henan Polytech Univ, Comp Sci & Technol, Jiaozuo, Henan, Peoples R China
[2] Univ Leicester, Comp & Math Sci, Leicester, England
[3] Xian Jiaotong Liverpool Univ, Dept Biol Sci, Suzhou, Jiangsu, Peoples R China
[4] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Informat Technol, Jeddah, Saudi Arabia
基金
中国国家自然科学基金;
关键词
cascade; CNN; multi-scale features; transformer; PLUS PLUS;
D O I
10.1002/ima.23184
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to the Transformer's ability to capture long-range dependencies through Self-Attention, it has shown immense potential in medical image segmentation. However, it lacks the capability to model local relationships between pixels. Therefore, many previous approaches embedded the Transformer into the CNN encoder. However, current methods often fall short in modeling the relationships between multi-scale features, specifically the spatial correspondence between features at different scales. This limitation can result in the ineffective capture of scale differences for each object and the loss of features for small targets. Furthermore, due to the high complexity of the Transformer, it is challenging to integrate local and global information within the same scale effectively. To address these limitations, we propose a novel backbone network called CasUNeXt, which features three appealing design elements: (1) We use the idea of cascade to redesign the way CNN and Transformer are combined to enhance modeling the unique interrelationships between multi-scale information. (2) We design a Cascaded Scale-wise Transformer Module capable of cross-scale interactions. It not only strengthens feature extraction within a single scale but also models interactions between different scales. (3) We overhaul the multi-head Channel Attention mechanism to enable it to model context information in feature maps from multiple perspectives within the channel dimension. These design features collectively enable CasUNeXt to better integrate local and global information and capture relationships between multi-scale features, thereby improving the performance of medical image segmentation. Through experimental comparisons on various benchmark datasets, our CasUNeXt method exhibits outstanding performance in medical image segmentation tasks, surpassing the current state-of-the-art methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Image Denoising Based on Correlations of Inter-scale Coefficients in Contourlet Domain
    Yang, Fan
    Zhao, Ruizhen
    Hu, Shaohai
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 42 - 45
  • [22] An intra- and inter-organisational perspective on industrial segmentation A segmentation classification framework
    Clarke, Ann H.
    Freytag, Per V.
    EUROPEAN JOURNAL OF MARKETING, 2008, 42 (9-10) : 1023 - 1038
  • [23] INTER-SCALE SURE-LET IMAGE RESTORATION WITH DEEP UNROLLED IMAGE PRIOR
    Li, Jikai
    Muramatsu, Shogo
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3095 - 3099
  • [24] Intra- and Inter-Image Causal Intervention for Robust Semantic Segmentation in Remote-Sensing Images
    Yu, Lei
    Jin, Qizhao
    Wang, Wei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [25] Remote sensing image semantic segmentation based on cascaded Transformer
    Wang F.
    Ji J.
    Wang Y.
    IEEE. Trans. Artif. Intell., 2024, 8 (4136-4148): : 1 - 12
  • [26] MESTrans: Multi-scale embedding spatial transformer for medical image segmentation
    Liu, Yatong
    Zhu, Yu
    Xin, Ying
    Zhang, Yanan
    Yang, Dawei
    Xu, Tao
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 233
  • [27] IB-TransUNet: Combining Information Bottleneck and Transformer for Medical Image Segmentation
    Li, Guangju
    Jin, Dehu
    Yu, Qi
    Qi, Meng
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (03) : 249 - 258
  • [28] Spatially adaptive image denoising using inter-scale dependence in directionlet domain
    Sethunadh, R.
    Thomas, Tessamma
    IET IMAGE PROCESSING, 2015, 9 (02) : 142 - 152
  • [29] Probabilistic Modeling of Inter- and Intra-observer Variability in Medical Image Segmentation
    Schmidt, Arne
    Morales-Alvarez, Pablo
    Molina, Rafael
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21040 - 21049
  • [30] Inter-scale dependency based adaptive shrinkage de-noising for image
    Wang, SQ
    Zou, DW
    Shen, SH
    Fang, ZJ
    WAVELET ANALYSIS AND ITS APPLICATIONS (WAA), VOLS 1 AND 2, 2003, : 789 - 793