VANet: a medical image fusion model based on attention mechanism to assist disease diagnosis

被引：0

作者：

Guo, Kai ^{[2
,3
]}

Li, Xiongfei ^{[2
,3
]}

Fan, Tiehu ^{[1
]}

Hu, Xiaohan ^{[4
]}

机构：

[1] Jilin Univ, Coll Instrumentat & Elect Engn, Changchun, Peoples R China

[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun, Peoples R China

[3] Jilin Univ, Coll Comp Sci & Technol, Changchun, Peoples R China

[4] First Hosp Jilin Univ, Dept Radiol, Changchun, Peoples R China

来源：

BMC BIOINFORMATICS | 2022年 / 23卷 / 01期

基金：

产业技术研究与开发资金项目; 中国国家自然科学基金;

关键词：

Medical image; Medical image fusion; Attention mechanism; Contextual information; Multi scale feature extraction;

D O I：

10.1186/s12859-022-05072-4

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Background: Today's biomedical imaging technology has been able to present the morphological structure or functional metabolic information of organisms at different scale levels, such as organ, tissue, cell, molecule and gene. However, different imaging modes have different application scope, advantages and disadvantages. In order to improve the role of medical image in disease diagnosis, the fusion of biomedical image information at different imaging modes and scales has become an important research direction in medical image. Traditional medical image fusion methods are all designed to measure the activity level and fusion rules. They are lack of mining the context features of different modes of image, which leads to the obstruction of improving the quality of fused images. Method: In this paper, an attention-multiscale network medical image fusion model based on contextual features is proposed. The model selects five backbone modules in the VGG-16 network to build encoders to obtain the contextual features of medical images. It builds the attention mechanism branch to complete the fusion of global contextual features and designs the residual multiscale detail processing branch to complete the fusion of local contextual features. Finally, it completes the cascade reconstruction of features by the decoder to obtain the fused image. Results: Ten sets of images related to five diseases are selected from the AANLIB database to validate the VANet model. Structural images are derived from MR images with high resolution and functional images are derived from SPECT and PET images that are good at describing organ blood flow levels and tissue metabolism. Fusion experiments are performed on twelve fusion algorithms including the VANet model. The model selects eight metrics from different aspects to build a fusion quality evaluation system to complete the performance evaluation of the fused images. Friedman's test and the post-hoc Nemenyi test are introduced to conduct professional statistical tests to demonstrate the superiority of VANet model. Conclusions: The VANet model completely captures and fuses the texture details and color information of the source images. From the fusion results, the metabolism and structural information of the model are well expressed and there is no interference of color information on the structure and texture; in terms of the objective evaluation system, the metric value of the VANet model is generally higher than that of other methods.; in terms of efficiency, the time consumption of the model is acceptable; in terms of scalability, the model is not affected by the input order of source images and can be extended to tri-modal fusion.

引用

页数：32

共 50 条

[41] A Multimodal Fusion Model Based on Hybrid Attention Mechanism for Gesture Recognition
Li, Yajie
Chen, Yiqiang
Gu, Yang
Ouyang, Jianquan
STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2020, 2021, 12644 : 302 - 312
[42] Small object detection model based on feature fusion of attention mechanism
Chen H.
Zhen X.
Zhao T.
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (03): : 60 - 66
[43] A Multi-Scale Cross-Fusion Medical Image Segmentation Network Based on Dual-Attention Mechanism Transformer
Cui, Jianguo
Wang, Liejun
Jiang, Shaochen
APPLIED SCIENCES-BASEL, 2023, 13 (19):
[44] Medical Image Classification Algorithm Based on Visual Attention Mechanism-MCNN
An, Fengping
Li, Xiaowei
Ma, Xingmin
OXIDATIVE MEDICINE AND CELLULAR LONGEVITY, 2021, 2021
[45] PAMSNet: A medical image segmentation network based on spatial pyramid and attention mechanism
Feng, Yuncong
Zhu, Xiaoyan
Zhang, Xiaoli
Li, Yang
Lu, Huimin
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 94
[46] DRCM: a disentangled representation network based on coordinate and multimodal attention for medical image fusion
Huang, Wanwan
Zhang, Han
Cheng, Yu
Quan, Xiongwen
FRONTIERS IN PHYSIOLOGY, 2023, 14
[47] Infrared and visible image fusion network based on low-light image enhancement and attention mechanism
Jinbo Lu
Zhen Pei
Jinling Chen
Kunyu Tan
Qi Ran
Hongyan Wang
Signal, Image and Video Processing, 2025, 19 (6)
[48] Towards accurate diagnosis: exploring knowledge distillation and self-attention in multimodal medical image fusion
Radhika, P.
Bobby, J. Sofia
Francis, Sheeja V.
Femina, M. A.
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2024,
[49] CRFNet: A Medical Image Segmentation Method Using the Cross Attention Mechanism and Refined Feature Fusion Strategy
Ma, Chengyun
Tian, Shengwei
Yu, Long
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT II, 2025, 15032 : 247 - 260
[50] FusionGRAM: An Infrared and Visible Image Fusion Framework Based on Gradient Residual and Attention Mechanism
Wang, Jinxin
Xi, Xiaoli
Li, Dongmei
Li, Fang
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72

← 1 2 3 4 5 →