Transformer-based heart organ segmentation using a novel axial attention and fusion mechanism

被引:0
|
作者
Addo, Addae Emmanuel [1 ]
Gedeon, Kashala Kabe [1 ,2 ]
Liu, Zhe [1 ]
机构
[1] Jiangsu Univ, Sch Comp Sci & Telecommun Engn, Zhenjiang, Peoples R China
[2] Jiangsu Univ, Sch Comp Sci & Telecommun Engn, Zhenjiang 212013, Peoples R China
来源
IMAGING SCIENCE JOURNAL | 2024年 / 72卷 / 01期
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Transformers; unet; heart-segmentation; long range dependencies; spatial encoding; positional encoding; axial attention; computed tomography (CT);
D O I
10.1080/13682199.2023.2198394
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Recent research on transformer-based models have highlighted particular methods for medical image segmentation. Additionally, the majority of transformer-based network designs used in computer vision applications have a significant number of parameters and demand extensive training datasets. Inspired by the success of transformers in recent researches, the unet-transformer approach has become one of the de-facto ideas in overcoming the above challenges. In this manuscript, a novel unet-transformer approach was proposed for heart image segmentation to solve parameters, limited dataset, over segmentation, sensitivity noise and higher training time problems. A framework in which a novel width and height wise axial attention mechanism is incorporated into the design to effectively give positional embeddings and encode spatial flattening. Furthermore, a novel local and global spatial attention mechanism is proposed to effectively learn the local and global interactions between encoder features. Finally, we introduce a mechanism to fuse both contexts for better feature representation and preparation into the decoder. The results demonstrate that our prototype provides a robust novel axial-attention mechanism.
引用
下载
收藏
页码:121 / 139
页数:19
相关论文
共 50 条
  • [1] A novel transformer-based network with attention mechanism for automatic pavement crack detection
    Guo, Feng
    Liu, Jian
    Lv, Chengshun
    Yu, Huayang
    CONSTRUCTION AND BUILDING MATERIALS, 2023, 391
  • [2] A Novel Transformer-Based Attention Network for Image Dehazing
    Gao, Guanlei
    Cao, Jie
    Bao, Chun
    Hao, Qun
    Ma, Aoqi
    Li, Gang
    SENSORS, 2022, 22 (09)
  • [3] TAGNet: A transformer-based axial guided network for bile duct segmentation
    Zhou, Guang-Quan
    Zhao, Fuxing
    Yang, Qing-Han
    Wang, Kai-Ni
    Li, Shengxiao
    Zhou, Shoujun
    Lu, Jian
    Chen, Yang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
  • [4] Transformer-Based Flood Detection Using Multiclass Segmentation
    Park, Joo-Chan
    Kim, Dong-Geon
    Yang, Ji-Ro
    Kang, Kyo-Seok
    2023 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, BIGCOMP, 2023, : 291 - 292
  • [5] Transformer-based multi-attention hybrid networks for skin lesion segmentation
    Dong, Zhiwei
    Li, Jinjiang
    Hua, Zhen
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 244
  • [6] Transformer-based ripeness segmentation for tomatoes
    Shinoda, Risa
    Kataoka, Hirokatsu
    Hara, Kensho
    Noguchi, Ryozo
    SMART AGRICULTURAL TECHNOLOGY, 2023, 4
  • [7] Transformer-Based Cross-Modal Information Fusion Network for Semantic Segmentation
    Duan, Zaipeng
    Huang, Xiao
    Ma, Jie
    NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6361 - 6375
  • [9] Fusion of Image-text attention for Transformer-based Multimodal Machine Translation
    Ma, Junteng
    Qin, Shihao
    Su, Lan
    Li, Xia
    Xiao, Lixian
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 199 - 204
  • [10] Transformer-Based Cross-Modal Information Fusion Network for Semantic Segmentation
    Zaipeng Duan
    Xiao Huang
    Jie Ma
    Neural Processing Letters, 2023, 55 : 6361 - 6375