Co-Enhancement of Multi-Modality Image Fusion and Object Detection via Feature Adaptation

被引:0
|
作者
Dong, Aimei [1 ,2 ,3 ]
Wang, Long [2 ]
Liu, Jian [2 ]
Xu, Jingyuan [2 ]
Zhao, Guixin [1 ,2 ,3 ]
Zhai, Yi [1 ,2 ,3 ]
Lv, Guohua [1 ,2 ,3 ]
Cheng, Jinyong [1 ,2 ,3 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Shandong Comp Sci Ctr, Natl Supercomp Ctr Jinan,Minist Educ,Key Lab Comp, Jinan 250316, Peoples R China
[2] Qilu Univ Technol, Shandong Acad Sci, Fac Comp Sci & Technol, Jinan 250316, Peoples R China
[3] Shandong Fundamental Res Ctr Comp Sci, Shandong Prov Key Lab Comp Networks, Jinan 250100, Peoples R China
关键词
Image fusion; Task analysis; Semantics; Feature extraction; Object detection; Visualization; Visual perception; object detection; feature adaptation; mutual promotion; MULTISCALE TRANSFORM; NETWORK; NEST;
D O I
10.1109/TCSVT.2024.3433555
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The integration of multi-modality images significantly enhances the clarity of critical details for object detection. Valuable semantic data from object detection enriches the fusion process of these images. However, the potential reciprocal relationship that could enhance their mutual performance remains largely unexplored and underutilized, despite some semantic-driven fusion methodologies catering to specific application needs. To address these limitations, this study proposes a mutually reinforcing, dual-task-driven fusion architecture. Specifically, our design integrates a feature-adaptive interlinking module into both image fusion and object detection components, effectively managing the inherent feature discrepancies. The core idea is to channel distinct features from both tasks into a unified feature space after feature transformation. We then design a feature-adaptive selection module to generate features rich in target semantic information and compatible with the fusion network. Finally, effective combination and mutual enhancement of the two tasks are achieved through an alternating training process. A diverse range of swift evaluations is performed across various datasets to corroborate the potential efficiency of our framework, actualizing visible advancements in both fusion effectiveness and detection accuracy.
引用
收藏
页码:12624 / 12637
页数:14
相关论文
共 50 条
  • [1] Multi-Modality Image Fusion and Object Detection Based on Semantic Information
    Liu, Yong
    Zhou, Xin
    Zhong, Wei
    ENTROPY, 2023, 25 (05)
  • [2] Equivariant Multi-Modality Image Fusion
    Zhao, Zixiang
    Hai, Haowen
    Zhang, Jiangshe
    Zhang, Yulun
    Zhane, Kai
    Xu, Shuang
    Chen, Dongdong
    Timofte, Radu
    Van Gool, Luc
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 25912 - 25921
  • [3] Deep learning supported disease detection with multi-modality image fusion
    Vinnarasi, F. Sangeetha Francelin
    Daniel, Jesline
    Rose, J. T. Anita
    Pugalenthi, R.
    JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY, 2021, 29 (03) : 411 - 434
  • [4] STAFuse: A Feature Decomposition Network with Super Token Attention for Multi-modality Image Fusion
    Chen, Peng
    Chen, Aiguo
    Wang, Chuang
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14880 : 324 - 335
  • [5] Multi-modality image fusion via generalized Riesz-wavelet transformation
    Jin, Bo
    Jing, Zhongliang
    Pan, Han
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2014, 8 (11): : 4118 - 4136
  • [6] Multi-modality image fusion for image-guided neurosurgery
    Haller, JW
    Ryken, T
    Madsen, M
    Edwards, A
    Bolinger, L
    Vannier, MW
    CARS '99: COMPUTER ASSISTED RADIOLOGY AND SURGERY, 1999, 1191 : 681 - 685
  • [7] Diff-IF: Multi-modality image fusion via diffusion model with fusion knowledge prior
    Yi, Xunpeng
    Tang, Linfeng
    Zhang, Hao
    Xu, Han
    Ma, Jiayi
    INFORMATION FUSION, 2024, 110
  • [8] MMYFnet: Multi-Modality YOLO Fusion Network for Object Detection in Remote Sensing Images
    Guo, Huinan
    Sun, Congying
    Zhang, Jing
    Zhang, Wuxia
    Zhang, Nengshuang
    REMOTE SENSING, 2024, 16 (23)
  • [9] A dual-stream feature decomposition network with weight transformation for multi-modality image fusion
    Hu, Tianqing
    Nan, Xiaofei
    Zhou, Xiabing
    Shen, Yu
    Zhou, Qinglei
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [10] Underwater Image Co-Enhancement With Correlation Feature Matching and Joint Learning
    Qi, Qi
    Zhang, Yongchang
    Tian, Fei
    Wu, Q. M. Jonathan
    Li, Kunqian
    Luan, Xin
    Song, Dalei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 1133 - 1147