VANet: a medical image fusion model based on attention mechanism to assist disease diagnosis

被引:0
|
作者
Guo, Kai [2 ,3 ]
Li, Xiongfei [2 ,3 ]
Fan, Tiehu [1 ]
Hu, Xiaohan [4 ]
机构
[1] Jilin Univ, Coll Instrumentat & Elect Engn, Changchun, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun, Peoples R China
[3] Jilin Univ, Coll Comp Sci & Technol, Changchun, Peoples R China
[4] First Hosp Jilin Univ, Dept Radiol, Changchun, Peoples R China
基金
产业技术研究与开发资金项目; 中国国家自然科学基金;
关键词
Medical image; Medical image fusion; Attention mechanism; Contextual information; Multi scale feature extraction;
D O I
10.1186/s12859-022-05072-4
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Today's biomedical imaging technology has been able to present the morphological structure or functional metabolic information of organisms at different scale levels, such as organ, tissue, cell, molecule and gene. However, different imaging modes have different application scope, advantages and disadvantages. In order to improve the role of medical image in disease diagnosis, the fusion of biomedical image information at different imaging modes and scales has become an important research direction in medical image. Traditional medical image fusion methods are all designed to measure the activity level and fusion rules. They are lack of mining the context features of different modes of image, which leads to the obstruction of improving the quality of fused images. Method: In this paper, an attention-multiscale network medical image fusion model based on contextual features is proposed. The model selects five backbone modules in the VGG-16 network to build encoders to obtain the contextual features of medical images. It builds the attention mechanism branch to complete the fusion of global contextual features and designs the residual multiscale detail processing branch to complete the fusion of local contextual features. Finally, it completes the cascade reconstruction of features by the decoder to obtain the fused image. Results: Ten sets of images related to five diseases are selected from the AANLIB database to validate the VANet model. Structural images are derived from MR images with high resolution and functional images are derived from SPECT and PET images that are good at describing organ blood flow levels and tissue metabolism. Fusion experiments are performed on twelve fusion algorithms including the VANet model. The model selects eight metrics from different aspects to build a fusion quality evaluation system to complete the performance evaluation of the fused images. Friedman's test and the post-hoc Nemenyi test are introduced to conduct professional statistical tests to demonstrate the superiority of VANet model. Conclusions: The VANet model completely captures and fuses the texture details and color information of the source images. From the fusion results, the metabolism and structural information of the model are well expressed and there is no interference of color information on the structure and texture; in terms of the objective evaluation system, the metric value of the VANet model is generally higher than that of other methods.; in terms of efficiency, the time consumption of the model is acceptable; in terms of scalability, the model is not affected by the input order of source images and can be extended to tri-modal fusion.
引用
收藏
页数:32
相关论文
共 50 条
  • [41] A Multimodal Fusion Model Based on Hybrid Attention Mechanism for Gesture Recognition
    Li, Yajie
    Chen, Yiqiang
    Gu, Yang
    Ouyang, Jianquan
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2020, 2021, 12644 : 302 - 312
  • [42] Small object detection model based on feature fusion of attention mechanism
    Chen H.
    Zhen X.
    Zhao T.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (03): : 60 - 66
  • [43] A Multi-Scale Cross-Fusion Medical Image Segmentation Network Based on Dual-Attention Mechanism Transformer
    Cui, Jianguo
    Wang, Liejun
    Jiang, Shaochen
    APPLIED SCIENCES-BASEL, 2023, 13 (19):
  • [44] Medical Image Classification Algorithm Based on Visual Attention Mechanism-MCNN
    An, Fengping
    Li, Xiaowei
    Ma, Xingmin
    OXIDATIVE MEDICINE AND CELLULAR LONGEVITY, 2021, 2021
  • [45] PAMSNet: A medical image segmentation network based on spatial pyramid and attention mechanism
    Feng, Yuncong
    Zhu, Xiaoyan
    Zhang, Xiaoli
    Li, Yang
    Lu, Huimin
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 94
  • [46] DRCM: a disentangled representation network based on coordinate and multimodal attention for medical image fusion
    Huang, Wanwan
    Zhang, Han
    Cheng, Yu
    Quan, Xiongwen
    FRONTIERS IN PHYSIOLOGY, 2023, 14
  • [47] Infrared and visible image fusion network based on low-light image enhancement and attention mechanism
    Jinbo Lu
    Zhen Pei
    Jinling Chen
    Kunyu Tan
    Qi Ran
    Hongyan Wang
    Signal, Image and Video Processing, 2025, 19 (6)
  • [48] Towards accurate diagnosis: exploring knowledge distillation and self-attention in multimodal medical image fusion
    Radhika, P.
    Bobby, J. Sofia
    Francis, Sheeja V.
    Femina, M. A.
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2024,
  • [49] CRFNet: A Medical Image Segmentation Method Using the Cross Attention Mechanism and Refined Feature Fusion Strategy
    Ma, Chengyun
    Tian, Shengwei
    Yu, Long
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT II, 2025, 15032 : 247 - 260
  • [50] FusionGRAM: An Infrared and Visible Image Fusion Framework Based on Gradient Residual and Attention Mechanism
    Wang, Jinxin
    Xi, Xiaoli
    Li, Dongmei
    Li, Fang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72