CasUNeXt: A Cascaded Transformer With Intra- and Inter-Scale Information for Medical Image Segmentation

被引:0
|
作者
Sun, Junding [1 ]
Zheng, Xiaopeng [1 ]
Wu, Xiaosheng [1 ]
Tang, Chaosheng [1 ]
Wang, Shuihua [1 ,2 ,3 ]
Zhang, Yudong [1 ,2 ,4 ]
机构
[1] Henan Polytech Univ, Comp Sci & Technol, Jiaozuo, Henan, Peoples R China
[2] Univ Leicester, Comp & Math Sci, Leicester, England
[3] Xian Jiaotong Liverpool Univ, Dept Biol Sci, Suzhou, Jiangsu, Peoples R China
[4] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Informat Technol, Jeddah, Saudi Arabia
基金
中国国家自然科学基金;
关键词
cascade; CNN; multi-scale features; transformer; PLUS PLUS;
D O I
10.1002/ima.23184
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to the Transformer's ability to capture long-range dependencies through Self-Attention, it has shown immense potential in medical image segmentation. However, it lacks the capability to model local relationships between pixels. Therefore, many previous approaches embedded the Transformer into the CNN encoder. However, current methods often fall short in modeling the relationships between multi-scale features, specifically the spatial correspondence between features at different scales. This limitation can result in the ineffective capture of scale differences for each object and the loss of features for small targets. Furthermore, due to the high complexity of the Transformer, it is challenging to integrate local and global information within the same scale effectively. To address these limitations, we propose a novel backbone network called CasUNeXt, which features three appealing design elements: (1) We use the idea of cascade to redesign the way CNN and Transformer are combined to enhance modeling the unique interrelationships between multi-scale information. (2) We design a Cascaded Scale-wise Transformer Module capable of cross-scale interactions. It not only strengthens feature extraction within a single scale but also models interactions between different scales. (3) We overhaul the multi-head Channel Attention mechanism to enable it to model context information in feature maps from multiple perspectives within the channel dimension. These design features collectively enable CasUNeXt to better integrate local and global information and capture relationships between multi-scale features, thereby improving the performance of medical image segmentation. Through experimental comparisons on various benchmark datasets, our CasUNeXt method exhibits outstanding performance in medical image segmentation tasks, surpassing the current state-of-the-art methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] CoinSeg: Contrast Inter- and Intra- Class Representations for Incremental Segmentation
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 843 - 853
  • [42] Medical Image Segmentation via Cascaded Attention Decoding
    Rahman, Md Mostafijur
    Marculescu, Radu
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 6211 - 6220
  • [43] I2CNet: An Intra- and Inter-Class Context Information Fusion Network for Blastocyst Segmentation
    Wang, Hua
    Qiu, Linwei
    Hu, Jingfei
    Zhang, Jicong
    IJCAI International Joint Conference on Artificial Intelligence, 2022, : 1415 - 1422
  • [44] Multi-Scale Orthogonal Model CNN-Transformer for Medical Image Segmentation
    Zhou, Wuyi
    Zeng, Xianhua
    Zhou, Mingkun
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (10)
  • [45] Feature ensemble network for medical image segmentation with multi-scale atrous transformer
    Gai, Di
    Geng, Yuhan
    Huang, Xia
    Huang, Zheng
    Xiong, Xin
    Zhou, Ruihua
    Wang, Qi
    IET IMAGE PROCESSING, 2024, 18 (11) : 3082 - 3092
  • [46] LTMSegnet: Lightweight multi-scale medical image segmentation combining Transformer and MLP
    Huang, Xin
    Tang, Hongxiang
    Ding, Yan
    Li, Yuanyuan
    Zhu, Zhiqin
    Yang, Pan
    Computers in Biology and Medicine, 2024, 183
  • [47] Remote sensing image denoising via sparse representation with inter-scale correlation model
    Cui, Zhi
    Cui, Xian-Pu
    International Journal of Earth Sciences and Engineering, 2015, 8 (06): : 2809 - 2816
  • [48] A study of attention information from transformer layers in hybrid medical image segmentation networks
    Hasany, Syed Nouman
    Petitjean, Caroline
    Meriaudeau, Fabrice
    MEDICAL IMAGING 2023, 2023, 12464
  • [49] Infrared image denoising based on improved threshold and inter-scale correlations of wavelet transform
    Yang H.-X.
    Wang X.-S.
    Xie P.-H.
    Leng A.-L.
    Peng Y.
    Zidonghua Xuebao/Acta Automatica Sinica, 2011, 37 (10): : 1167 - 1174
  • [50] HI-Net: Liver vessel segmentation with hierarchical inter-scale multi-scale feature fusion
    Liu, Zhe
    Teng, Qiaoying
    Song, Yuqing
    Hao, Wen
    Liu, Yi
    Zhu, Yan
    Li, Yuefeng
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 96