WaveFusionNet: Infrared and visible image fusion based on multi-scale feature encoder-decoder and discrete wavelet decomposition

被引:0
|
作者
Liu, Renhe [1 ]
Liu, Yu [1 ]
Wang, Han [1 ]
Du, Shan [2 ]
机构
[1] Tianjin Univ, Sch Microelect, Tianjin 300072, Peoples R China
[2] Univ British Columbia, Dept Comp Sci Math Phys & Stat, Okanagan Campus, Kelowna, BC V1V 1V7, Canada
关键词
Infrared and visible image fusion; Frequency feature decomposition; Discrete wavelet transform; Multi-scale encoder; Dual-band feature fusion; QUALITY ASSESSMENT; TRANSFORM; FRAMEWORK; NETWORK; NEST;
D O I
10.1016/j.optcom.2024.131024
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
To merge complementary information from multimodal images, such as thermal saliency from infrared images and texture details from visible images, traditional multi-scale transform-based methods have been extensively studied, with deep learning-based methods gaining significant popularity in recent years. However, there has been limited research on optimally combining the advantages of these two categories in fusion. In this paper, we propose a novel infrared and visible image fusion (IVIF) framework, WaveFusionNet, which integrates precise frequency feature decomposition from the discrete wavelet transform (DWT) with the comprehensive feature extraction from the multi-scale encoder. Firstly, we train an encoder-decoder network for multi- scale feature extraction and image reconstruction. DWT is used for down-sampling with minimal information loss by decomposing extracted features into low and high-frequency sub-bands. Next, a dual-band feature fusion (DBFF) module is trained to merge these sub-bands by integrating a spatial feature transform-based sub-network for low-frequency fusion and a maximum absolute value selection strategy for fusing high- frequencies. Finally, all fused sub-bands are fed into the pre-trained decoder to reconstruct the final image. Experimental results on three benchmark datasets (TNO, Roadscene, and MSRS) demonstrate that the proposed fusion method outperforms recent IVIF methods in both quantitative assessment and visual perception while maintaining competitive time complexity.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Roadway Crack Segmentation Based on an Encoder-decoder Deep Network with Multi-scale Convolutional Blocks
    Sun, Mengyuan
    Guo, Runhua
    Zhu, Jinhui
    Fan, Wenhui
    2020 10TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2020, : 869 - 874
  • [42] Multi-Scale Attention and Encoder-Decoder Network for Video Saliency Object Detection
    Bi, Hongbo
    Zhu, Huihui
    Yang, Lina
    Wu, Ranwan
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2022, 32 (02) : 340 - 350
  • [43] Infrared and visible image fusion with the use of multi-scale edge-preserving decomposition and guided image filter
    Gan, Wei
    Wu, Xiaohong
    Wu, Wei
    Yang, Xiaomin
    Ren, Chao
    He, Xiaohai
    Liu, Kai
    INFRARED PHYSICS & TECHNOLOGY, 2015, 72 : 37 - 51
  • [44] An infrared and visible image fusion method based on multi-scale transformation and norm optimization
    Li, Guofa
    Lin, Yongjie
    Qu, Xingda
    INFORMATION FUSION, 2021, 71 : 109 - 129
  • [45] Infrared and visible image fusion based on multi-scale dense attention connection network
    Chen Y.
    Zhang J.
    Wang Z.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (18): : 2253 - 2266
  • [46] Infrared and visible image fusion enhancement technology based on multi-scale directional analysis
    Zhou Xin
    Liu Rui-an
    Chen Fin
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4035 - 4037
  • [47] A multi-scale information integration framework for infrared and visible image fusion
    Yang, Guang
    Li, Jie
    Lei, Hanxiao
    Gao, Xinbo
    NEUROCOMPUTING, 2024, 600
  • [48] Infrared and visible image fusion for ship targets based on scale-aware feature decomposition
    Zheng, Xin
    Kang, Di
    Si, Pengbo
    Wu, Qiang
    IET IMAGE PROCESSING, 2022, 16 (14) : 3977 - 3987
  • [49] Infrared and visible image fusion using multi-scale pyramid network
    Zuo, Fengyuan
    Huang, Yongdong
    Li, Qiufu
    Su, Weijian
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2022, 20 (05)
  • [50] Infrared image enhancement through saliency feature analysis based on multi-scale decomposition
    Zhao, Jufeng
    Chen, Yueting
    Feng, Huajun
    Xu, Zhihai
    Li, Qi
    INFRARED PHYSICS & TECHNOLOGY, 2014, 62 : 86 - 93