WaveFusionNet: Infrared and visible image fusion based on multi-scale feature encoder-decoder and discrete wavelet decomposition

被引:0
|
作者
Liu, Renhe [1 ]
Liu, Yu [1 ]
Wang, Han [1 ]
Du, Shan [2 ]
机构
[1] Tianjin Univ, Sch Microelect, Tianjin 300072, Peoples R China
[2] Univ British Columbia, Dept Comp Sci Math Phys & Stat, Okanagan Campus, Kelowna, BC V1V 1V7, Canada
关键词
Infrared and visible image fusion; Frequency feature decomposition; Discrete wavelet transform; Multi-scale encoder; Dual-band feature fusion; QUALITY ASSESSMENT; TRANSFORM; FRAMEWORK; NETWORK; NEST;
D O I
10.1016/j.optcom.2024.131024
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
To merge complementary information from multimodal images, such as thermal saliency from infrared images and texture details from visible images, traditional multi-scale transform-based methods have been extensively studied, with deep learning-based methods gaining significant popularity in recent years. However, there has been limited research on optimally combining the advantages of these two categories in fusion. In this paper, we propose a novel infrared and visible image fusion (IVIF) framework, WaveFusionNet, which integrates precise frequency feature decomposition from the discrete wavelet transform (DWT) with the comprehensive feature extraction from the multi-scale encoder. Firstly, we train an encoder-decoder network for multi- scale feature extraction and image reconstruction. DWT is used for down-sampling with minimal information loss by decomposing extracted features into low and high-frequency sub-bands. Next, a dual-band feature fusion (DBFF) module is trained to merge these sub-bands by integrating a spatial feature transform-based sub-network for low-frequency fusion and a maximum absolute value selection strategy for fusing high- frequencies. Finally, all fused sub-bands are fed into the pre-trained decoder to reconstruct the final image. Experimental results on three benchmark datasets (TNO, Roadscene, and MSRS) demonstrate that the proposed fusion method outperforms recent IVIF methods in both quantitative assessment and visual perception while maintaining competitive time complexity.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Fusion of visible and infrared images based on multi-scale image enhancement
    Sun, Ming-Chao
    Zhang, Chong
    Liu, Jing-Hong
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2012, 42 (03): : 738 - 742
  • [22] Infrared and Visible Image Fusion using Multi-Scale Decomposition and Visual Saliency Map
    Chen, Yunfan
    Xie, Han
    Yeo, Donghoon
    Shin, Hyunchul
    2018 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2018, : 243 - 244
  • [23] Infrared and Visible Image Fusion Using Multi-scale Decomposition and Partial Differential Equations
    Trivedi G.
    Sanghvi R.
    International Journal of Applied and Computational Mathematics, 2024, 10 (4)
  • [24] Infrared and Visible Image Fusion Based on Contrast Enhancement and Multi-scale Edge-preserving Decomposition
    Zhu Haoran
    Liu Yunqing
    Zhang Wenying
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2018, 40 (06) : 1294 - 1300
  • [25] Multi-scale decomposition based fusion of infrared and visible image via total variation and saliency analysis
    Ma, Tao
    Ma, Jie
    Fang, Bin
    Hu, Fangyu
    Quan, Siwen
    Du, Huajun
    INFRARED PHYSICS & TECHNOLOGY, 2018, 92 : 154 - 162
  • [26] Near-infrared and visible fusion for image enhancement based on multi-scale decomposition with rolling WLSF
    Zhu, Yuan
    Sun, Xudong
    Zhang, Hongqi
    Wang, Jue
    Fu, Xianping
    INFRARED PHYSICS & TECHNOLOGY, 2023, 128
  • [27] Multi-scale Recurrent Encoder-Decoder Network for Dense Temporal Classification
    Choo, Sungkwon
    Seo, Wonkyo
    Jeong, Dong-Ju
    Cho, Nam Ik
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 103 - 108
  • [28] MRI/SPECT Image Fusion of Brain Based on Multi-Scale Wavelet Decomposition
    Georgieva, Veska M.
    Petrov, Plamen P.
    Tsvetkova, Diana S.
    Laskov, Lyubomir B.
    2021 56TH INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATION, COMMUNICATION AND ENERGY SYSTEMS AND TECHNOLOGIES (ICEST), 2021, : 85 - 88
  • [29] Multi-scale deep encoder-decoder network for salient object detection
    Ren, Qinghua
    Hu, Renjie
    NEUROCOMPUTING, 2018, 316 : 95 - 104
  • [30] SMFD: an end-to-end infrared and visible image fusion model based on shared-individual multi-scale feature decomposition
    Xu, Mingrui
    Kong, Jun
    Jiang, Min
    Liu, Tianshan
    JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (02)