D-TrAttUnet: Toward hybrid CNN-transformer architecture for generic and subtle segmentation in medical images

被引:0
|
作者
Bougourzi F. [1 ]
Dornaika F. [3 ,4 ]
Distante C. [2 ]
Taleb-Ahmed A. [5 ]
机构
[1] Junia, UMR 8520, CNRS, Centrale Lille, University of Polytechnique Hauts-de-France, Lille
[2] Institute of Applied Sciences and Intelligent Systems, National Research Council of Italy, Lecce
[3] University of the Basque Country UPV/EHU, San Sebastian
[4] IKERBASQUE, Basque Foundation for Science, Bilbao
[5] Université Polytechnique Hauts-de-France, Université de Lille, CNRS, Valenciennes, Hauts-de-France
关键词
Bone Metastasis; Convolutional Neural Network; Covid-19; Deep learning; Segmentation; Transformer; Unet;
D O I
10.1016/j.compbiomed.2024.108590
中图分类号
学科分类号
摘要
Over the past two decades, machine analysis of medical imaging has advanced rapidly, opening up significant potential for several important medical applications. As complicated diseases increase and the number of cases rises, the role of machine-based imaging analysis has become indispensable. It serves as both a tool and an assistant to medical experts, providing valuable insights and guidance. A particularly challenging task in this area is lesion segmentation, a task that is challenging even for experienced radiologists. The complexity of this task highlights the urgent need for robust machine learning approaches to support medical staff. In response, we present our novel solution: the D-TrAttUnet architecture. This framework is based on the observation that different diseases often target specific organs. Our architecture includes an encoder–decoder structure with a composite Transformer-CNN encoder and dual decoders. The encoder includes two paths: the Transformer path and the Encoders Fusion Module path. The Dual-Decoder configuration uses two identical decoders, each with attention gates. This allows the model to simultaneously segment lesions and organs and integrate their segmentation losses. To validate our approach, we performed evaluations on the Covid-19 and Bone Metastasis segmentation tasks. We also investigated the adaptability of the model by testing it without the second decoder in the segmentation of glands and nuclei. The results confirmed the superiority of our approach, especially in Covid-19 infections and the segmentation of bone metastases. In addition, the hybrid encoder showed exceptional performance in the segmentation of glands and nuclei, solidifying its role in modern medical image analysis. © 2024 The Author(s)
引用
收藏
相关论文
共 50 条
  • [31] Weak Appearance Aware Pipeline Leak Detection based on CNN-Transformer Hybrid Architecture
    Zhang, Bulin
    Yuan, Haiwen
    Ge, Jie
    Cheng, Li
    Li, Xuan
    Xiao, Changshi
    IEEE Transactions on Instrumentation and Measurement, 2024,
  • [32] Add-Vit: CNN-Transformer Hybrid Architecture for Small Data Paradigm Processing
    Chen, Jinhui
    Wu, Peng
    Zhang, Xiaoming
    Xu, Renjie
    Liang, Jia
    NEURAL PROCESSING LETTERS, 2024, 56 (03)
  • [33] CC-TransXNet: a hybrid CNN-transformer network for automatic segmentation of optic cup and optic disk from fundus images
    Zhongzheng Yuan
    Jinke Wang
    Yukun Xu
    Min Xu
    Medical & Biological Engineering & Computing, 2025, 63 (4) : 1027 - 1044
  • [34] SEGTRANSVAE: HYBRID CNN - TRANSFORMER WITH REGULARIZATION FOR MEDICAL IMAGE SEGMENTATION
    Quan-Dung Pham
    Hai Nguyen-Truong
    Nam Nguyen Phuong
    Nguyen, Khoa N. A.
    Nguyen, Chanh D. T.
    Bui, Trung
    Truong, Steven Q. H.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [35] UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation
    Gao, Yunhe
    Zhou, Mu
    Metaxas, Dimitris N.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 61 - 71
  • [36] Agricultural innovation through deep learning: a hybrid CNN-Transformer architecture for crop disease classification
    Padshetty, Smitha
    Umashetty, Ambika
    JOURNAL OF SPATIAL SCIENCE, 2024,
  • [37] Hybrid CNN-Transformer Architecture for Efficient Large-Scale Video Snapshot Compressive Imaging
    Cao, Miao
    Wang, Lishun
    Zhu, Mingyu
    Yuan, Xin
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, : 4521 - 4540
  • [38] Cross Attention Multi Scale CNN-Transformer Hybrid Encoder Is General Medical Image Learner
    Zhou, Rongzhou
    Yao, Junfeng
    Hong, Qingqi
    Li, Xingxin
    Cao, Xianpeng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 85 - 97
  • [39] SWFormer: A scale-wise hybrid CNN-Transformer network for multi-classes weed segmentation
    Jiang, Hongkui
    Chen, Qiupu
    Wang, Rujing
    Du, Jianming
    Chen, Tianjiao
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (07)
  • [40] MedFCT: A Frequency Domain Joint CNN-Transformer Network for Semi-supervised Medical Image Segmentation
    Xie, Shiao
    Huang, Huimin
    Niu, Ziwei
    Lin, Lanfen
    Chen, Yen-Wei
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1913 - 1918