TransSea: Hybrid CNN-Transformer With Semantic Awareness for 3-D Brain Tumor Segmentation

被引:7
|
作者
Liu, Yu [1 ,2 ]
Ma, Yize [1 ,2 ]
Zhu, Zhiqin [3 ]
Cheng, Juan [1 ,2 ]
Chen, Xun [4 ]
机构
[1] Hefei Univ Technol, Dept Biomed Engn, Hefei 230009, Peoples R China
[2] Hefei Univ Technol, Anhui Prov Key Lab Measuring Theory & Precis Instr, Hefei 230009, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Coll Automat, Chongqing 400065, Peoples R China
[4] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230027, Peoples R China
基金
中国国家自然科学基金;
关键词
Brain tumor segmentation; convolutional neural networks (CNNs); multimodal magnetic resonance imaging (MRI); semantic guidance (SG); Transformer; U-NET; ATTENTION;
D O I
10.1109/TIM.2024.3413130
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Accurate segmentation of brain tumors in multimodal magnetic resonance imaging (MRI) plays a crucial role in clinical quantitative assessments, diagnostic processes, and the planning of therapeutic strategies. Both convolutional neural networks (CNNs) with strong local information extraction capacities and Transformers with excellent global representation capacities have achieved remarkable performance in medical image segmentation. However, considering the inherent semantic disparities between local and global features, effectively combining convolutions and Transformers presents a significant challenge in medical image segmentation. To address this issue, through integrating the merits of these two paradigms in a well-designed encoder-decoder architecture, we propose a hybrid CNN-Transformer network with semantic awareness, named TransSea, for an accurate 3-D brain tumor segmentation task. Our network incorporates a semantic mutual attention (SMA) module at the encoding stage, seamlessly integrating global and local features. Furthermore, our design includes a multiscale semantic guidance (SG) module that introduces semantic priors in the encoder through semantic supervision, enabling focused segmentation in relevant areas. In the decoding process, a semantic integration (SI) module is presented to further integrate various feature mappings from the encoder and semantic priors, thereby enhancing the propagation of semantic information and achieving semantically aware querying. Extensive experiments on two brain tumor datasets, BraTS2020 and BraTS2021, demonstrate that our model significantly outperforms existing state-of-the-art methods. The source code of the proposed method will be made available at https://github.com/yuliu316316.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Efficient scheme to perform semantic segmentation on 3-D brain tumor using 3-D u-net architecture
    Zeeshan Shaukat
    Qurratul Ain Farooq
    Chuangbai Xiao
    Saqib Ali
    Faheem Akhtar
    Muhammad Azeem
    Abdul Ahad Zulfiqar
    Multimedia Tools and Applications, 2024, 83 : 25121 - 25134
  • [32] Combined 3D CNN for Brain Tumor Segmentation
    Ahmad, Parvez
    Jin, Hai
    Qamar, Saqib
    Zheng, Ran
    Jiang, Wenbin
    THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2020), 2020, : 113 - 116
  • [33] Efficient scheme to perform semantic segmentation on 3-D brain tumor using 3-D u-net architecture
    Shaukat, Zeeshan
    Farooq, Qurratul Ain
    Xiao, Chuangbai
    Ali, Saqib
    Akhtar, Faheem
    Azeem, Muhammad
    Zulfiqar, Abdul Ahad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 25121 - 25134
  • [34] ENHANCING HYBRID CNN-TRANSFORMER VIA FREQUENCY-BASED BRIDGING FOR MEDICAL IMAGE SEGMENTATION
    Zeng Xinyi
    Tang Cheng
    Zeng Pinxian
    Cui Jiaqi
    Yan Binyu
    Wang Peng
    Wang Yan
    IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI 2024, 2024,
  • [35] ACTNet: A Dual-Attention Adapter with a CNN-Transformer Network for the Semantic Segmentation of Remote Sensing Imagery
    Zhang, Zheng
    Liu, Fanchen
    Liu, Changan
    Tian, Qing
    Qu, Hongquan
    REMOTE SENSING, 2023, 15 (09)
  • [36] A Lightweight CNN-Transformer Network With Laplacian Loss for Low-Altitude UAV Imagery Semantic Segmentation
    Lu, Wen
    Zhang, Zhiqi
    Nguyen, Minh
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 20
  • [37] PFormer: An efficient CNN-Transformer hybrid network with content-driven P-attention for 3D medical image segmentation
    Gao, Yueyang
    Zhang, Jinhui
    Wei, Siyi
    Li, Zheng
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 101
  • [38] SwinUNeLCsT: Global-local spatial representation learning with hybrid CNN-transformer for efficient tuberculosis lung cavity weakly supervised semantic segmentation
    Tan, Zhuoyi
    Madzin, Hizmawati
    Norafida, Bahari
    Rahmat, Rahmita Wirza O. K.
    Khalid, Fatimah
    Sulaiman, Puteri Suhaiza
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (04)
  • [39] SSNet: A Novel Transformer and CNN Hybrid Network for Remote Sensing Semantic Segmentation
    Yao, Min
    Zhang, Yaozu
    Liu, Guofeng
    Pang, Dongdong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 3023 - 3037
  • [40] Hybrid CNN and Transformer Network for Semantic Segmentation of UAV Remote Sensing Images
    Zhou X.
    Zhou L.
    Gong S.
    Zhang H.
    Zhong S.
    Xia Y.
    Huang Y.
    IEEE Journal on Miniaturization for Air and Space Systems, 2024, 5 (01): : 33 - 41