SwinUNeLCsT: Global-local spatial representation learning with hybrid CNN-transformer for efficient tuberculosis lung cavity weakly supervised semantic segmentation

被引:4
|
作者
Tan, Zhuoyi [1 ]
Madzin, Hizmawati [1 ]
Norafida, Bahari [2 ]
Rahmat, Rahmita Wirza O. K. [1 ]
Khalid, Fatimah [1 ]
Sulaiman, Puteri Suhaiza
机构
[1] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Serdang 43400, Malaysia
[2] Dept Radiol, Univ Putra Malaysia, Serdang 43400, Selangor, Malaysia
关键词
Deep learning; Classification; Semantic segmentation; Weakly-supervised learning; CT tuberculosis imaging; IMAGE;
D O I
10.1016/j.jksuci.2024.102012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Radiological diagnosis of lung cavities (LCs) is the key to identifying tuberculosis (TB). Conventional deep learning methods rely on a large amount of accurate pixel -level data to segment LCs. This process is timeconsuming and laborious, especially for those subtle LCs. To address such challenges, firstly, we introduce a novel 3D TB LCs imaging convolutional neural network (CNN) -transformer hybrid model (SwinUNeLCsT). The core idea of SwinUNeLCsT is to combine local details and global dependencies for TB CT scan image feature representation to effectively improve the recognition ability of LCs. Secondly, to reduce the dependence on accurate pixel -level annotations, we design an end -to -end LCs weakly supervised semantic segmentation (WSSS) framework. Through this framework, radiologists need only to classify the number and the approximate location (e.g., left lung, right lung, or both) of LCs in the CT scan to achieve efficient segmentation of the LCs. This process eliminates the need for meticulously drawing boundaries, greatly reducing the cost of annotation. Extensive experimental results show that SwinUNeLCsT outperforms currently popular medical 3D segmentation methods in the supervised semantic segmentation paradigm. Meanwhile, our WSSS framework based on SwinUNeLCsT also performs best among the existing state-of-the-art medical 3D WSSS methods.
引用
收藏
页数:15
相关论文
共 9 条
  • [1] Leveraging Swin Transformer for Local-to-Global Weakly Supervised Semantic Segmentation
    Ahmadi, Rozhan
    Kasaei, Shohreh
    PROCEEDINGS OF THE 13TH IRANIAN/3RD INTERNATIONAL MACHINE VISION AND IMAGE PROCESSING CONFERENCE, MVIP, 2024, : 117 - 123
  • [2] A hybrid CNN-transformer network: Accurate and efficient semantic segmentation of crops and weeds on resource-constrained embedded devices
    Wei, Yifan
    Feng, Yuncong
    Zu, Dongcheng
    Zhang, Xiaoli
    CROP PROTECTION, 2025, 188
  • [3] UNesT: Local Spatial Representation Learning with Hierarchical Transformer for Efficient Medical Segmentation
    Yu, Xin
    Yang, Qi
    Zhou, Yinchi
    Cai, Leon Y.
    Gao, Riqiang
    Lee, Ho Hin
    Li, Thomas
    Bao, Shunxing
    Xu, Zhoubing
    Lasko, Thomas A.
    Abramson, Richard G.
    Zhang, Zizhao
    Huo, Yuankai
    Landman, Bennett A.
    Tang, Yucheng
    arXiv, 2022,
  • [4] UNesT: Local spatial representation learning with hierarchical transformer for efficient medical segmentation
    Yu, Xin
    Yang, Qi
    Zhou, Yinchi
    Cai, Leon Y.
    Gao, Riqiang
    Lee, Ho Hin
    Li, Thomas
    Bao, Shunxing
    Xu, Zhoubing
    Lasko, Thomas A.
    Abramson, Richard G.
    Zhang, Zizhao
    Huo, Yuankai
    Landman, Bennett A.
    Tang, Yucheng
    MEDICAL IMAGE ANALYSIS, 2023, 90
  • [5] Radial awareness with adaptive hybrid CNN-Transformer range-view representation for outdoor LiDAR point cloud semantic segmentation
    He, Xiang
    Li, Xu
    Xu, Qimin
    Hu, Yue
    Sun, Zhengliang
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 271
  • [6] Cascaded CNN and global-local attention transformer network-based semantic segmentation for high-resolution remote sensing image
    Liu, Xiaohui
    Zhang, Lei
    Wang, Rui
    Li, Xiaoyu
    Xu, Jiyang
    Lu, Xiaochen
    JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (03)
  • [7] Cascaded CNN and global-local attention transformer network-based semantic segmentation for high-resolution remote sensing image
    Liu, Xiaohui
    Zhang, Lei
    Wang, Rui
    Li, Xiaoyu
    Xu, Jiyang
    Lu, Xiaochen
    Journal of Applied Remote Sensing, 2024, 18 (03)
  • [8] ResU-Former: Advancing Remote Sensing Image Segmentation with Swin Residual Transformer for Precise Global-Local Feature Recognition and Visual-Semantic Space Learning
    Li, Hanlu
    Li, Lei
    Zhao, Liangyu
    Liu, Fuxiang
    ELECTRONICS, 2024, 13 (02)
  • [9] G2LL: GLOBAL-TO-LOCAL SELF-SUPERVISED LEARNING FOR LABEL-EFFICIENT TRANSFORMER-BASED SKIN LESION SEGMENTATION IN DERMOSCOPY IMAGES
    Chen, Fei
    Wang, Jiacheng
    Magnier, Baptiste
    Xue, Wei
    Huang, Shaohui
    Wang, Liansheng
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,