Multiple attention channels aggregated network for multimodal medical image fusion

被引:0
|
作者
Huang, Jingxue [1 ]
Tan, Tianshu [2 ]
Li, Xiaosong [1 ,3 ,4 ]
Ye, Tao [5 ]
Wu, Yanxiong [1 ]
机构
[1] Foshan Univ, Sch Phys & Optoelect Engn, Foshan 528225, Peoples R China
[2] Hong Kong Univ Sci & Technol, Sch Engn, Kowloon, Hong Kong, Peoples R China
[3] Foshan Univ, Guangdong Prov Key Lab Ind Intelligent Inspect Tec, Foshan, Peoples R China
[4] Foshan Univ, Guangdong HongKong Macao Joint Lab Intelligent Mic, Foshan, Peoples R China
[5] China Univ Min & Technol Beijing, Sch Mech Elect Informat Engn, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
attention interaction; multimodal medical image fusion; multiscale features; QUALITY ASSESSMENT; ALGORITHM;
D O I
10.1002/mp.17607
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
BackgroundIn clinical practices, doctors usually need to synthesize several single-modality medical images for diagnosis, which is a time-consuming and costly process. With this background, multimodal medical image fusion (MMIF) techniques have emerged to synthesize medical images of different modalities, providing a comprehensive and objective interpretation of the lesion.PurposeAlthough existing MMIF approaches have shown promising results, they often overlook the importance of multiscale feature diversity and attention interaction, which are essential for superior visual outcomes. This oversight can lead to diminished fusion performance. To bridge the gaps, we introduce a novel approach that emphasizes the integration of multiscale features through a structured decomposition and attention interaction.MethodsOur method first decomposes the source images into three distinct groups of multiscale features by stacking different numbers of diverse branch blocks. Then, to extract global and local information separately for each group of features, we designed the convolutional and Transformer block attention branch. These two attention branches make full use of channel and spatial attention mechanisms and achieve attention interaction, enabling the corresponding feature channels to fully capture local and global information and achieve effective inter-block feature aggregation.ResultsFor the MRI-PET fusion type, MACAN achieves average improvements of 24.48%, 27.65%, 19.24%, 27.32%, 18.51%, and 10.33% over the compared methods in terms of Qcb, AG, SSIM, SF, Qabf, and VIF metrics, respectively. Similarly, for the MRI-SPECT fusion type, MACAN outperforms the compared methods with average improvements of 29.13%, 26.43%, 18.20%, 27.71%, 16.79%, and 10.38% in the same metrics. In addition, our method demonstrates promising results in segmentation experiments. Specifically, for the T2-T1ce fusion, it achieves a Dice coefficient of 0.60 and a Hausdorff distance of 15.15. Comparable performance is observed for the Flair-T1ce fusion, with a Dice coefficient of 0.60 and a Hausdorff distance of 13.27.ConclusionThe proposed multiple attention channels aggregated network (MACAN) can effectively retain the complementary information from source images. The evaluation of MACAN through medical image fusion and segmentation experiments on public datasets demonstrated its superiority over the state-of-the-art methods, both in terms of visual quality and objective metrics. Our code is available at https://github.com/JasonWong30/MACAN.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Multimodal Medical Image Fusion Based on Multichannel Aggregated Network
    Huang, Jingxue
    Li, Xiaosong
    Tan, Haishu
    Cheng, Xiaoqi
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 14359 LNCS : 14 - 25
  • [2] DRCM: a disentangled representation network based on coordinate and multimodal attention for medical image fusion
    Huang, Wanwan
    Zhang, Han
    Cheng, Yu
    Quan, Xiongwen
    FRONTIERS IN PHYSIOLOGY, 2023, 14
  • [3] Multimodal parallel attention network for medical image segmentation
    Wang, Zhibing
    Wang, Wenmin
    Li, Nannan
    Zhang, Shenyong
    Chen, Qi
    Jiang, Zhe
    IMAGE AND VISION COMPUTING, 2024, 147
  • [4] MAAFusion: A Multimodal Medical Image Fusion Network Via Arbitrary Kernel Convolution And Attention Mechanism
    Wang, Wenqing
    He, Ji
    Li, Lingzhou
    2024 2ND ASIA CONFERENCE ON COMPUTER VISION, IMAGE PROCESSING AND PATTERN RECOGNITION, CVIPPR 2024, 2024,
  • [5] FFANet: Feature fusion attention network to medical image segmentation
    Yu, Jiankang
    Yang, Dedong
    Zhao, Hanshuo
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 69 (69)
  • [6] A multiscale residual pyramid attention network for medical image fusion
    Fu, Jun
    Li, Weisheng
    Du, Jiao
    Huang, Yuping
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 66
  • [7] MMIF-INet: Multimodal medical image fusion by invertible network
    He, Dan
    Li, Weisheng
    Wang, Guofen
    Huang, Yuping
    Liu, Shiqiang
    INFORMATION FUSION, 2025, 114
  • [8] Multimodal Medical Image Fusion Network Based on Target Information Enhancement
    Zhou, Yuting
    Yang, Xuemei
    Liu, Shiqi
    Yin, Junping
    IEEE ACCESS, 2024, 12 : 70851 - 70869
  • [9] Hierarchical Progressive Network for Multimodal Medical Image Fusion in Healthcare Systems
    Yang, Sihan
    Yang, Xiaomin
    Zhang, Rongzhu
    Liu, Kai
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (04) : 1540 - 1558
  • [10] IGNFusion: An Unsupervised Information Gate Network for Multimodal Medical Image Fusion
    Wang, Chengchao
    Nie, Rencan
    Cao, Jinde
    Wang, Xue
    Zhang, Ying
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (04) : 854 - 868