CABnet: A channel attention dual adversarial balancing network for multimodal image fusion

被引:3
|
作者
Sun, Le [1 ]
Tang, Mengqi [1 ]
Muhammad, Ghulam [2 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Ctr Atmospher Environm & Equipment Technol CICAEET, Dept Jiangsu Collaborat Innovat, Nanjing 210044, Jiangsu, Peoples R China
[2] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Riyadh 11543, Saudi Arabia
关键词
Image processing; Infrared and visible image fusion; Complementary information extract; Generative adversarial networks; Adaptive factor; ENSEMBLE;
D O I
10.1016/j.imavis.2024.105065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Infrared and visible image fusion aims to generate informative images by leveraging the distinctive strengths of infrared and visible modalities. These fused images play a crucial role in subsequent downstream tasks, including object detection, recognition, and segmentation. However, complementary information is often difficult to extract. Existing generative adversarial network-based methods generate fused images by modifying the distribution of source images' features to preserve instances and texture details in both infrared and visible images. Nevertheless, these approaches may result in a degradation of the fused image quality when the original image quality is low. Considering the balance of information from different modalities can improve the quality of the fused image. Hence, we introduce CABnet, a Channel Attention dual adversarial Balancing network. CABnet incorporates a channel attention mechanism to capture crucial channel features, thereby, enhancing complementary information. It also employs an adaptive factor to control the mixing distribution of infrared and visible images, which ensures the preservation of instances and texture details during the adversarial process. To enhance efficiency and reduce reliance on manual labeling, our training process adopts a semi-supervised learning strategy. Through qualitative and quantitative experiments across multiple datasets, CABnet surpasses existing state-of-the-art methods in fusion performance, notably achieving a 51.3% enhancement in signal to noise ratio and a 13.4% improvement in standard deviation.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Multimodal Fusion Generative Adversarial Network for Image Synthesis
    Zhao, Liang
    Hu, Qinghao
    Li, Xiaoyuan
    Zhao, Jingyuan
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1865 - 1869
  • [2] DMMFnet: A Dual-Branch Multimodal Medical Image Fusion Network Using Super Token and Channel-Spatial Attention
    Zhang, Yukun
    Wang, Lei
    Tahir, Muhammad
    Huang, Zizhen
    Han, Yaolong
    Yang, Shanliang
    Liu, Shilong
    Saeed, Muhammad Imran
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 696 - 705
  • [3] Boosting attention fusion generative adversarial network for image denoising
    Qiongshuai Lyu
    Min Guo
    Miao Ma
    Neural Computing and Applications, 2021, 33 : 4833 - 4847
  • [4] Boosting attention fusion generative adversarial network for image denoising
    Lyu, Qiongshuai
    Guo, Min
    Ma, Miao
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (10): : 4833 - 4847
  • [5] Infrared and Visible Image Fusion Method Based on Dual-Channel Generative Adversarial Network
    Hou Chunping
    Wang Xiaocong
    Xia Han
    Yang Yang
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (14)
  • [6] DSAGAN: A generative adversarial network based on dual-stream attention mechanism for anatomical and functional image fusion
    Fu, Jun
    Li, Weisheng
    Du, Jiao
    Xu, Liming
    INFORMATION SCIENCES, 2021, 576 : 484 - 506
  • [7] DSAGAN: A generative adversarial network based on dual-stream attention mechanism for anatomical and functional image fusion
    Fu, Jun
    Li, Weisheng
    Du, Jiao
    Xu, Liming
    Li, Weisheng (liws@cqupt.edu.cn), 1600, Elsevier Inc. (576): : 484 - 506
  • [8] Multimodal medical image fusion combining saliency perception and generative adversarial network
    Albekairi, Mohammed
    Mohamed, Mohamed vall O.
    Kaaniche, Khaled
    Abbas, Ghulam
    Alanazi, Meshari D.
    Alanazi, Turki M.
    Emara, Ahmed
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [9] Multiple attention channels aggregated network for multimodal medical image fusion
    Huang, Jingxue
    Tan, Tianshu
    Li, Xiaosong
    Ye, Tao
    Wu, Yanxiong
    MEDICAL PHYSICS, 2024,
  • [10] DAFCNN: A Dual-Channel Feature Extraction and Attention Feature Fusion Convolution Neural Network for SAR Image and MS Image Fusion
    Luo, Jiahao
    Zhou, Fang
    Yang, Jun
    Xing, Mengdao
    REMOTE SENSING, 2023, 15 (12)