Cross-modality complementary information fusion for multispectral pedestrian detection

被引:12
|
作者
Yan, Chaoqi [1 ]
Zhang, Hong [1 ]
Li, Xuliang [1 ]
Yang, Yifan [2 ]
Yuan, Ding [1 ]
机构
[1] Beihang Univ, Image Proc Ctr, 37 Xueyuan Rd, Beijing 100191, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, 37 Xueyuan Rd, Beijing 100191, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 14期
基金
中国国家自然科学基金;
关键词
Multispectral pedestrian detection; Cross-modality; Information fusion; Illumination-aware; Feature alignment; DEEP NEURAL-NETWORKS;
D O I
10.1007/s00521-023-08239-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multispectral pedestrian detection has received increasing attention in recent years as color and thermal modalities can provide complementary visual information, especially under insufficient illumination conditions. However, there is still a persistent crucial problem that how to design the cross-modality fusion mechanism to fully exploit the complementary characteristics between different modalities. In this paper, we propose a novel cross-modality complementary information fusion network (denoted as CCIFNet) to comprehensively capture the long-range interactions with precise positional information and meanwhile preserve the inter-spatial relationship between different modalities in the feature extraction stage. Further, we design an adaptive illumination-aware weight generation module to adaptively weight the final detection confidence of color and thermal modalities by taking various illumination conditions into consideration. Specifically, we comprehensively compare three different fusion strategies about this module to synthetically explore the best way for generating the final illumination-aware fusion weights. Finally, we present a simple but effective feature alignment module to alleviate the position shift problem caused by the weakly aligned color-thermal image pairs. Extensive experiments and ablation studies on KAIST, CVC-14, FLIR and LLVIP multispectral object detection datasets show that the proposed CCIFNet can achieve state-of-the-art performance under different illumination evaluation settings, while keeping a competitive speed-accuracy trade-off for real-time applications.
引用
收藏
页码:10361 / 10386
页数:26
相关论文
共 50 条
  • [11] Cross-modality attentive feature fusion for object detection in multispectral remote sensing imagery
    Fang Qingyun
    Wang Zhaokui
    PATTERN RECOGNITION, 2022, 130
  • [12] Cross-modality attentive feature fusion for object detection in multispectral remote sensing imagery
    Qingyun, Fang
    Zhaokui, Wang
    Pattern Recognition, 2022, 130
  • [13] Multimodal Pedestrian Detection Based on Cross-Modality Reference Search
    Lee, Wei-Yu
    Jovanov, Ljubomir
    Philips, Wilfried
    IEEE SENSORS JOURNAL, 2024, 24 (10) : 17291 - 17306
  • [14] A MULTISPECTRAL-INFRARED OBJECT DETECTION METHOD BASED ON CROSS-MODALITY IMAGE FEATURE FILTERING FUSION
    Liu, Ze
    Su, Nan
    Zhao, Chunhui
    Yan, Yiming
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6823 - 6825
  • [15] Efficient cross-modality feature interaction for multispectral armored vehicle detection
    Zhang, Jie
    Chang, Tian-qing
    Zhao, Li-yang
    Ma, Jin-dun
    Han, Bin
    Zhang, Lei
    APPLIED SOFT COMPUTING, 2024, 163
  • [16] Cross-modality collaborative learning identified pedestrian
    Wen, Xiongjun
    Feng, Xin
    Li, Ping
    Chen, Wenfang
    VISUAL COMPUTER, 2023, 39 (09): : 4117 - 4132
  • [17] Cross-modality collaborative learning identified pedestrian
    Xiongjun Wen
    Xin Feng
    Ping Li
    Wenfang Chen
    The Visual Computer, 2023, 39 : 4117 - 4132
  • [18] An Efficient Cross-Modality Self-Calibrated Network for Hyperspectral and Multispectral Image Fusion
    Wu, Huapeng
    Gui, Jie
    Xu, Yang
    Wu, Zebin
    Tang, Yuan Yan
    Wei, Zhihui
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [19] IMPROVING RGB-INFRARED PEDESTRIAN DETECTION BY REDUCING CROSS-MODALITY REDUNDANCY
    Wang, Qingwang
    Chi, Yongke
    Shen, Tao
    Song, Jian
    Zhang, Zifeng
    Zhu, Yan
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 526 - 530
  • [20] Keypoints and Descriptors Based on Cross-Modality Information Fusion for Camera Localization
    MA Shuo
    GAO Yongbin
    TIAN Fangzheng
    LU Junxin
    HUANG Bo
    GU Jia
    ZHOU Yilong
    WuhanUniversityJournalofNaturalSciences, 2021, 26 (02) : 128 - 136