Transformer-based heart organ segmentation using a novel axial attention and fusion mechanism

被引:0
|
作者
Addo, Addae Emmanuel [1 ]
Gedeon, Kashala Kabe [1 ,2 ]
Liu, Zhe [1 ]
机构
[1] Jiangsu Univ, Sch Comp Sci & Telecommun Engn, Zhenjiang, Peoples R China
[2] Jiangsu Univ, Sch Comp Sci & Telecommun Engn, Zhenjiang 212013, Peoples R China
来源
IMAGING SCIENCE JOURNAL | 2024年 / 72卷 / 01期
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Transformers; unet; heart-segmentation; long range dependencies; spatial encoding; positional encoding; axial attention; computed tomography (CT);
D O I
10.1080/13682199.2023.2198394
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Recent research on transformer-based models have highlighted particular methods for medical image segmentation. Additionally, the majority of transformer-based network designs used in computer vision applications have a significant number of parameters and demand extensive training datasets. Inspired by the success of transformers in recent researches, the unet-transformer approach has become one of the de-facto ideas in overcoming the above challenges. In this manuscript, a novel unet-transformer approach was proposed for heart image segmentation to solve parameters, limited dataset, over segmentation, sensitivity noise and higher training time problems. A framework in which a novel width and height wise axial attention mechanism is incorporated into the design to effectively give positional embeddings and encode spatial flattening. Furthermore, a novel local and global spatial attention mechanism is proposed to effectively learn the local and global interactions between encoder features. Finally, we introduce a mechanism to fuse both contexts for better feature representation and preparation into the decoder. The results demonstrate that our prototype provides a robust novel axial-attention mechanism.
引用
收藏
页码:121 / 139
页数:19
相关论文
共 50 条
  • [21] Diabetic Foot Ulcer Segmentation Using Convolutional and Transformer-Based Models
    Hassib, Mariam
    Ali, Maram
    Mohamed, Amina
    Torki, Marwan
    Hussein, Mohamed
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 13797 LNCS : 83 - 91
  • [22] A transformer-based approach empowered by a self-attention technique for semantic segmentation in remote sensing
    Boulila, Wadii
    Ghandorh, Hamza
    Masood, Sharjeel
    Alzahem, Ayyub
    Koubaa, Anis
    Ahmed, Fawad
    Khan, Zahid
    Ahmad, Jawad
    HELIYON, 2024, 10 (08)
  • [23] Temporal fusion transformer-based prediction in aquaponics
    Metin, Ahmet
    Kasif, Ahmet
    Catal, Cagatay
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (17): : 19934 - 19958
  • [24] TBiSeg: A transformer-based network with bi-level routing attention for inland waterway segmentation
    Fu, Chuanmao
    Li, Meng
    Zhang, Bo
    OCEAN ENGINEERING, 2024, 311
  • [25] Temporal fusion transformer-based prediction in aquaponics
    Ahmet Metin
    Ahmet Kasif
    Cagatay Catal
    The Journal of Supercomputing, 2023, 79 : 19934 - 19958
  • [26] Transformer-based Dynamic Fusion Clustering Network
    Zhang, Chunchun
    Zhao, Yaliang
    Wang, Jinke
    KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [27] Panoptic segmentation network based on fusion coding and attention mechanism
    Zhang, Jiarui
    Tian, Penghui
    Cognitive Robotics, 2022, 2 : 186 - 192
  • [28] Nonintrusive Load Monitoring (NILM) Using a Deep Learning Model with a Transformer-Based Attention Mechanism and Temporal Pooling
    Azad, Mohammad Irani
    Rajabi, Roozbeh
    Estebsari, Abouzar
    ELECTRONICS, 2024, 13 (02)
  • [29] A Transformer-based Multi-modal Joint Attention Fusion Model for Molecular Property Prediction
    Wang, Ke
    Zhang, Wei
    Liu, Yong
    Proceedings - 2023 2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023, 2023, : 4972 - 4974
  • [30] Evaluating Transformer-based Semantic Segmentation Networks for Pathological Image Segmentation
    Cam Nguyen
    Asad, Zuhayr
    Deng, Ruining
    Huo, Yuankai
    MEDICAL IMAGING 2022: IMAGE PROCESSING, 2022, 12032