WaveFusionNet: Infrared and visible image fusion based on multi-scale feature encoder-decoder and discrete wavelet decomposition

被引:0
|
作者
Liu, Renhe [1 ]
Liu, Yu [1 ]
Wang, Han [1 ]
Du, Shan [2 ]
机构
[1] Tianjin Univ, Sch Microelect, Tianjin 300072, Peoples R China
[2] Univ British Columbia, Dept Comp Sci Math Phys & Stat, Okanagan Campus, Kelowna, BC V1V 1V7, Canada
关键词
Infrared and visible image fusion; Frequency feature decomposition; Discrete wavelet transform; Multi-scale encoder; Dual-band feature fusion; QUALITY ASSESSMENT; TRANSFORM; FRAMEWORK; NETWORK; NEST;
D O I
10.1016/j.optcom.2024.131024
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
To merge complementary information from multimodal images, such as thermal saliency from infrared images and texture details from visible images, traditional multi-scale transform-based methods have been extensively studied, with deep learning-based methods gaining significant popularity in recent years. However, there has been limited research on optimally combining the advantages of these two categories in fusion. In this paper, we propose a novel infrared and visible image fusion (IVIF) framework, WaveFusionNet, which integrates precise frequency feature decomposition from the discrete wavelet transform (DWT) with the comprehensive feature extraction from the multi-scale encoder. Firstly, we train an encoder-decoder network for multi- scale feature extraction and image reconstruction. DWT is used for down-sampling with minimal information loss by decomposing extracted features into low and high-frequency sub-bands. Next, a dual-band feature fusion (DBFF) module is trained to merge these sub-bands by integrating a spatial feature transform-based sub-network for low-frequency fusion and a maximum absolute value selection strategy for fusing high- frequencies. Finally, all fused sub-bands are fed into the pre-trained decoder to reconstruct the final image. Experimental results on three benchmark datasets (TNO, Roadscene, and MSRS) demonstrate that the proposed fusion method outperforms recent IVIF methods in both quantitative assessment and visual perception while maintaining competitive time complexity.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] A VISIBLE AND INFRARED IMAGE FUSION FRAMEWORK BASED ON DUAL-PATH ENCODER-DECODER AND MULTI-SCALE DISCRETE WAVELET TRANSFORM
    Liu, Renhe
    Wang, Han
    Du, Shan
    Liu, Yu
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1995 - 1999
  • [2] CUFD: An encoder-decoder network for visible and infrared image fusion based on common and unique feature decomposition
    Xu, Han
    Gong, Meiqi
    Tian, Xin
    Huang, Jun
    Ma, Jiayi
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 218
  • [3] Encoder-Decoder with Multi-scale Information Fusion for Semantic Image Segmentation
    Ma, Xinxin
    Liu, Kai
    Ding, Chongyang
    Yan, Lin
    Duan, Meiyu
    ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
  • [4] VISIBLE AND INFRARED IMAGE FUSION USING ENCODER-DECODER NETWORK
    Ataman, Ferhat Can
    Bozdagi Akar, Gozde
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1779 - 1783
  • [5] Image deblurring via multi-scale feature fusion and multi-input multi-output encoder-decoder
    Zhao Q.
    Zhou D.
    Yang H.
    Wang C.
    Li M.
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2022, 51 (10):
  • [6] Multi-scale fusion residual encoder-decoder approach for low illumination image enhancement
    Pan Xiaoying
    Wei Miao
    Wang Hao
    Jia Fengzhu
    The Journal of China Universities of Posts and Telecommunications, 2022, (02) : 63 - 72
  • [7] A Multi-Scale Fusion Residual Encoder-Decoder Approach for Low Illumination Image Enhancement
    Pan X.
    Wei M.
    Wang H.
    Jia F.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (01): : 104 - 112
  • [8] Multi-scale fusion residual encoder-decoder approach for low illumination image enhancement
    Xiaoying P.
    Miao W.
    Hao W.
    Fengzhü J.
    Journal of China Universities of Posts and Telecommunications, 2022, 29 (02): : 63 - 72
  • [9] Infrared and visual image fusion based on multi-scale feature decomposition
    Yan, Huibin
    Li, Zhongmin
    OPTIK, 2020, 203
  • [10] HYPERSPECTRAL IMAGE CLASSIFICATION VIA MULTI-SCALE ENCODER-DECODER NETWORK
    Ma, Jingjing
    Wu, Linlin
    Tang, Xu
    Zhang, Xiangrong
    Zhu, Cheng
    Ma, Junyong
    Jiao, Licheng
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1283 - 1286