Co-Enhancement of Multi-Modality Image Fusion and Object Detection via Feature Adaptation

被引:0
|
作者
Dong, Aimei [1 ,2 ,3 ]
Wang, Long [2 ]
Liu, Jian [2 ]
Xu, Jingyuan [2 ]
Zhao, Guixin [1 ,2 ,3 ]
Zhai, Yi [1 ,2 ,3 ]
Lv, Guohua [1 ,2 ,3 ]
Cheng, Jinyong [1 ,2 ,3 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Shandong Comp Sci Ctr, Natl Supercomp Ctr Jinan,Minist Educ,Key Lab Comp, Jinan 250316, Peoples R China
[2] Qilu Univ Technol, Shandong Acad Sci, Fac Comp Sci & Technol, Jinan 250316, Peoples R China
[3] Shandong Fundamental Res Ctr Comp Sci, Shandong Prov Key Lab Comp Networks, Jinan 250100, Peoples R China
关键词
Image fusion; Task analysis; Semantics; Feature extraction; Object detection; Visualization; Visual perception; object detection; feature adaptation; mutual promotion; MULTISCALE TRANSFORM; NETWORK; NEST;
D O I
10.1109/TCSVT.2024.3433555
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The integration of multi-modality images significantly enhances the clarity of critical details for object detection. Valuable semantic data from object detection enriches the fusion process of these images. However, the potential reciprocal relationship that could enhance their mutual performance remains largely unexplored and underutilized, despite some semantic-driven fusion methodologies catering to specific application needs. To address these limitations, this study proposes a mutually reinforcing, dual-task-driven fusion architecture. Specifically, our design integrates a feature-adaptive interlinking module into both image fusion and object detection components, effectively managing the inherent feature discrepancies. The core idea is to channel distinct features from both tasks into a unified feature space after feature transformation. We then design a feature-adaptive selection module to generate features rich in target semantic information and compatible with the fusion network. Finally, effective combination and mutual enhancement of the two tasks are achieved through an alternating training process. A diverse range of swift evaluations is performed across various datasets to corroborate the potential efficiency of our framework, actualizing visible advancements in both fusion effectiveness and detection accuracy.
引用
收藏
页码:12624 / 12637
页数:14
相关论文
共 50 条
  • [21] Multi-Modality Image Fusion Using the Nonsubsampled Contourlet Transform
    Liu, Cuiyin
    Chen, Shu-qing
    Fu, Qiao
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (10): : 2215 - 2223
  • [22] The application of wavelet transform to multi-modality medical image fusion
    Wang, Anna
    Sun, Haijing
    Guan, Yueyang
    PROCEEDINGS OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, 2006, : 270 - 274
  • [23] Multi-modality gaze-contingent displays for image fusion
    Nikolov, SG
    Bull, DR
    Canagarajah, CN
    Jones, MG
    Gilchrist, ID
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOL II, 2002, : 1213 - 1220
  • [24] Multi-Modality Sensing and Data Fusion for Multi-Vehicle Detection
    Roy, Debashri
    Li, Yuanyuan
    Jian, Tong
    Tian, Peng
    Chowdhury, Kaushik
    Ioannidis, Stratis
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2280 - 2295
  • [25] CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion
    Zhao, Zixiang
    Bai, Haowen
    Zhang, Jiangshe
    Zhang, Yulun
    Xu, Shuang
    Lin, Zudi
    Timofte, Radu
    Van Gool, Luc
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5906 - 5916
  • [26] Searching a Hierarchically Aggregated Fusion Architecture for Fast Multi-Modality Image Fusion
    Liu, Risheng
    Liu, Zhu
    Liu, Jinyuan
    Fan, Xin
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1600 - 1608
  • [27] RGB-D Salient Object Detection via Feature Fusion and Multi-scale Enhancement
    Wu, Peiliang
    Duan, Liangliang
    Kong, Lingfu
    COMPUTER VISION, CCCV 2015, PT II, 2015, 547 : 359 - 368
  • [28] Multi-Modality Tensor Fusion Based Human Fatigue Detection
    Ha, Jongwoo
    Ryu, Joonhyuck
    Ko, Joonghoon
    ELECTRONICS, 2023, 12 (15)
  • [29] Multi-day multi-modality image co-registration
    Smith, E
    Barbee, D
    Pyzalski, R
    Jeraj, R
    MEDICAL PHYSICS, 2005, 32 (06) : 1894 - 1894
  • [30] AFDFusion: An adaptive frequency decoupling fusion network for multi-modality image
    Wang, Chengchao
    Zhao, Zhengpeng
    Yang, Qiuxia
    Nie, Rencan
    Cao, Jinde
    Pu, Yuanyuan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 263