CasUNeXt: A Cascaded Transformer With Intra- and Inter-Scale Information for Medical Image Segmentation

被引:0
|
作者
Sun, Junding [1 ]
Zheng, Xiaopeng [1 ]
Wu, Xiaosheng [1 ]
Tang, Chaosheng [1 ]
Wang, Shuihua [1 ,2 ,3 ]
Zhang, Yudong [1 ,2 ,4 ]
机构
[1] Henan Polytech Univ, Comp Sci & Technol, Jiaozuo, Henan, Peoples R China
[2] Univ Leicester, Comp & Math Sci, Leicester, England
[3] Xian Jiaotong Liverpool Univ, Dept Biol Sci, Suzhou, Jiangsu, Peoples R China
[4] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Informat Technol, Jeddah, Saudi Arabia
基金
中国国家自然科学基金;
关键词
cascade; CNN; multi-scale features; transformer; PLUS PLUS;
D O I
10.1002/ima.23184
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to the Transformer's ability to capture long-range dependencies through Self-Attention, it has shown immense potential in medical image segmentation. However, it lacks the capability to model local relationships between pixels. Therefore, many previous approaches embedded the Transformer into the CNN encoder. However, current methods often fall short in modeling the relationships between multi-scale features, specifically the spatial correspondence between features at different scales. This limitation can result in the ineffective capture of scale differences for each object and the loss of features for small targets. Furthermore, due to the high complexity of the Transformer, it is challenging to integrate local and global information within the same scale effectively. To address these limitations, we propose a novel backbone network called CasUNeXt, which features three appealing design elements: (1) We use the idea of cascade to redesign the way CNN and Transformer are combined to enhance modeling the unique interrelationships between multi-scale information. (2) We design a Cascaded Scale-wise Transformer Module capable of cross-scale interactions. It not only strengthens feature extraction within a single scale but also models interactions between different scales. (3) We overhaul the multi-head Channel Attention mechanism to enable it to model context information in feature maps from multiple perspectives within the channel dimension. These design features collectively enable CasUNeXt to better integrate local and global information and capture relationships between multi-scale features, thereby improving the performance of medical image segmentation. Through experimental comparisons on various benchmark datasets, our CasUNeXt method exhibits outstanding performance in medical image segmentation tasks, surpassing the current state-of-the-art methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Image denoising by anisotropic diffusion with inter-scale information fusion
    Prasath V.B.S.
    Prasath, V. B. Surya (prasaths@missouri.edu), 2017, Izdatel'stvo Nauka (27) : 748 - 753
  • [2] I2-Net: Intra- and Inter-scale Collaborative Learning Network for Abdominal Multi-organ Segmentation
    Suo, Chao
    Li, Xuanya
    Tan, Donghui
    Zhang, Yuan
    Gao, Xieping
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 654 - 660
  • [3] Customized wavelet denoising using intra- and inter-scale dependency for bearing fault detection
    Li Zhen
    He Zhengjia
    Zi Yanyang
    Wang Yanxue
    JOURNAL OF SOUND AND VIBRATION, 2008, 313 (1-2) : 342 - 359
  • [4] TV-based Multi-Scale Super Resolution using Intra- and Inter-Scale Correlations
    Wu, Jiying
    Fu, Jingjing
    Zeng, Bing
    2010 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 2646 - 2649
  • [5] Progressive inter-scale and intra-scale non-blind image deconvolution
    Yuan, Lu
    Sun, Jian
    Quan, Long
    Shum, Heung-Yeung
    ACM TRANSACTIONS ON GRAPHICS, 2008, 27 (03):
  • [6] Color Image Segmentation Based on Vectorial Multiscale Diffusion with Inter-scale Linking
    Prasath, V. B. Surya
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2009, 5909 : 339 - 344
  • [7] Multi-scale Hierarchical Vision Transformer with Cascaded Attention Decoding for Medical Image Segmentation
    Rahman, Md Mostafijur
    Marculescu, Radu
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 1526 - 1544
  • [8] Skin Lesion Segmentation Improved by Transformer-Based Networks with Inter-scale Dependency Modeling
    Eskandari, Sania
    Lumpp, Janet
    Giraldo, Luis Sanchez
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2023, PT I, 2024, 14348 : 351 - 360
  • [9] Enhanced transformer encoder and hybrid cascaded upsampler for medical image segmentation
    Li, Chaoqun
    Wang, Liejun
    Cheng, Shuli
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [10] Multi-modal medical image fusion using the inter-scale and intra-scale dependencies between image shift-invariant shearlet coefficients
    Wang, Lei
    Li, Bin
    Tian, Lian-Fang
    INFORMATION FUSION, 2014, 19 : 20 - 28