TransSea: Hybrid CNN-Transformer With Semantic Awareness for 3-D Brain Tumor Segmentation

被引：7

作者：

Liu, Yu ^{[1
,2
]}

Ma, Yize ^{[1
,2
]}

Zhu, Zhiqin ^{[3
]}

Cheng, Juan ^{[1
,2
]}

Chen, Xun ^{[4
]}

机构：

[1] Hefei Univ Technol, Dept Biomed Engn, Hefei 230009, Peoples R China

[2] Hefei Univ Technol, Anhui Prov Key Lab Measuring Theory & Precis Instr, Hefei 230009, Peoples R China

[3] Chongqing Univ Posts & Telecommun, Coll Automat, Chongqing 400065, Peoples R China

[4] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230027, Peoples R China

来源：

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT | 2024年 / 73卷

基金：

中国国家自然科学基金;

关键词：

Brain tumor segmentation; convolutional neural networks (CNNs); multimodal magnetic resonance imaging (MRI); semantic guidance (SG); Transformer; U-NET; ATTENTION;

D O I：

10.1109/TIM.2024.3413130

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Accurate segmentation of brain tumors in multimodal magnetic resonance imaging (MRI) plays a crucial role in clinical quantitative assessments, diagnostic processes, and the planning of therapeutic strategies. Both convolutional neural networks (CNNs) with strong local information extraction capacities and Transformers with excellent global representation capacities have achieved remarkable performance in medical image segmentation. However, considering the inherent semantic disparities between local and global features, effectively combining convolutions and Transformers presents a significant challenge in medical image segmentation. To address this issue, through integrating the merits of these two paradigms in a well-designed encoder-decoder architecture, we propose a hybrid CNN-Transformer network with semantic awareness, named TransSea, for an accurate 3-D brain tumor segmentation task. Our network incorporates a semantic mutual attention (SMA) module at the encoding stage, seamlessly integrating global and local features. Furthermore, our design includes a multiscale semantic guidance (SG) module that introduces semantic priors in the encoder through semantic supervision, enabling focused segmentation in relevant areas. In the decoding process, a semantic integration (SI) module is presented to further integrate various feature mappings from the encoder and semantic priors, thereby enhancing the propagation of semantic information and achieving semantically aware querying. Extensive experiments on two brain tumor datasets, BraTS2020 and BraTS2021, demonstrate that our model significantly outperforms existing state-of-the-art methods. The source code of the proposed method will be made available at https://github.com/yuliu316316.

引用

页数：16

共 50 条

[31] Efficient scheme to perform semantic segmentation on 3-D brain tumor using 3-D u-net architecture
Zeeshan Shaukat
Qurratul Ain Farooq
Chuangbai Xiao
Saqib Ali
Faheem Akhtar
Muhammad Azeem
Abdul Ahad Zulfiqar
Multimedia Tools and Applications, 2024, 83 : 25121 - 25134
[32] Combined 3D CNN for Brain Tumor Segmentation
Ahmad, Parvez
Jin, Hai
Qamar, Saqib
Zheng, Ran
Jiang, Wenbin
THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2020), 2020, : 113 - 116
[33] Efficient scheme to perform semantic segmentation on 3-D brain tumor using 3-D u-net architecture
Shaukat, Zeeshan
Farooq, Qurratul Ain
Xiao, Chuangbai
Ali, Saqib
Akhtar, Faheem
Azeem, Muhammad
Zulfiqar, Abdul Ahad
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 25121 - 25134
[34] ENHANCING HYBRID CNN-TRANSFORMER VIA FREQUENCY-BASED BRIDGING FOR MEDICAL IMAGE SEGMENTATION
Zeng Xinyi
Tang Cheng
Zeng Pinxian
Cui Jiaqi
Yan Binyu
Wang Peng
Wang Yan
IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI 2024, 2024,
[35] ACTNet: A Dual-Attention Adapter with a CNN-Transformer Network for the Semantic Segmentation of Remote Sensing Imagery
Zhang, Zheng
Liu, Fanchen
Liu, Changan
Tian, Qing
Qu, Hongquan
REMOTE SENSING, 2023, 15 (09)
[36] A Lightweight CNN-Transformer Network With Laplacian Loss for Low-Altitude UAV Imagery Semantic Segmentation
Lu, Wen
Zhang, Zhiqi
Nguyen, Minh
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 20
[37] PFormer: An efficient CNN-Transformer hybrid network with content-driven P-attention for 3D medical image segmentation
Gao, Yueyang
Zhang, Jinhui
Wei, Siyi
Li, Zheng
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 101
[38] SwinUNeLCsT: Global-local spatial representation learning with hybrid CNN-transformer for efficient tuberculosis lung cavity weakly supervised semantic segmentation
Tan, Zhuoyi
Madzin, Hizmawati
Norafida, Bahari
Rahmat, Rahmita Wirza O. K.
Khalid, Fatimah
Sulaiman, Puteri Suhaiza
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (04)
[39] SSNet: A Novel Transformer and CNN Hybrid Network for Remote Sensing Semantic Segmentation
Yao, Min
Zhang, Yaozu
Liu, Guofeng
Pang, Dongdong
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 3023 - 3037
[40] Hybrid CNN and Transformer Network for Semantic Segmentation of UAV Remote Sensing Images
Zhou X.
Zhou L.
Gong S.
Zhang H.
Zhong S.
Xia Y.
Huang Y.
IEEE Journal on Miniaturization for Air and Space Systems, 2024, 5 (01): : 33 - 41

← 1 2 3 4 5 →