A feature aggregation network for multispectral pedestrian detection

被引:0
|
作者
Yan Gong
Lu Wang
Lisheng Xu
机构
[1] Northeastern University,School of Computer Science and Engineering
[2] Northeastern University,College of Medicine and Biomedical Information Engineering
来源
Applied Intelligence | 2023年 / 53卷
关键词
Multispectral pedestrian detection; Feature aggregation; Saliency map; Attention mechanism;
D O I
暂无
中图分类号
学科分类号
摘要
Pedestrian detection is an important task in many computer vision applications. Since multispectral pedestrian detection can alleviate the difficulties of insufficient illumination at night, it has been rapidly developed in recent years. However, the way for effective color-thermal image fusion still needs further research. In this paper, we propose a Feature Aggregation Module (FAM) that can adaptively capture the cross-channel and cross-dimension information interaction of the two modalities. In addition, we develop a Feature Aggregation Network (FANet) that embeds the proposed FAM module into a two-stream network adapted from the YOLOv5. FANet has the advantages that its size is small (15 MB) and it runs fast (8 ms per frame). Extensive experiments on the KAIST dataset show that the proposed method is effective for multispectral pedestrian detection, especially in the night-time condition, for which the Miss Rate is only 8.91%. Moreover, we show that the saliency map computed from the thermal image can be incorporated into FANet to further improve the detection accuracy. In order to verify the generalization ability of the FAM module, we have also conducted experiments on the person re-identification datasets, namely Market1501 and Duke. The performance of our FAM compares favorably against existing feature fusion mechanisms on the two datasets.
引用
收藏
页码:22117 / 22131
页数:14
相关论文
共 50 条
  • [41] Pedestrian Detection Using Regional Proposal Network with Feature Fusion
    Lv, Xiaogang
    Zhang, Xiaotao
    Jiang, Yinghua
    Zhang, Jianxin
    2018 EIGHTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2018, : 108 - 112
  • [42] MCANet: Multiscale Cross-Modality Attention Network for Multispectral Pedestrian Detection
    Wang, Xiaotian
    Zhao, Letian
    Wu, Wei
    Jin, Xi
    MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 41 - 53
  • [43] Two-stream small-scale pedestrian detection network with feature aggregation for drone-view videos
    Han Xie
    Hyunchul Shin
    Multidimensional Systems and Signal Processing, 2021, 32 : 897 - 913
  • [44] Two-stream small-scale pedestrian detection network with feature aggregation for drone-view videos
    Xie, Han
    Shin, Hyunchul
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2021, 32 (03) : 897 - 913
  • [45] Integrated Feature Pyramid Network With Feature Aggregation for Traffic Sign Detection
    Tang, Qing
    Cao, Ge
    Jo, Kang-Hyun
    IEEE ACCESS, 2021, 9 : 117784 - 117794
  • [46] Cascaded information enhancement and cross-modal attention feature fusion for multispectral pedestrian detection
    Yang, Yang
    Xu, Kaixiong
    Wang, Kaizheng
    FRONTIERS IN PHYSICS, 2023, 11
  • [47] MFMANet: a multispectral pedestrian detection network using multi-resolution RGB feature reuse with multi-scale FIR attentions
    Guo, Jiaren
    Zhang, Yuzhen
    Zheng, Jianyin
    Huang, Zihao
    Tao, Yanyun
    MACHINE VISION AND APPLICATIONS, 2024, 35 (04)
  • [48] Deep Feature Aggregation Network for Hyperspectral Anomaly Detection
    Cheng, Xi
    Huo, Yu
    Lin, Sheng
    Dong, Youqiang
    Zhao, Shaobo
    Zhang, Min
    Wang, Hai
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 1
  • [49] FANet: Feature aggregation network for RGBD saliency detection
    Zhou, Xiaofei
    Wen, Hongfa
    Shi, Ran
    Yin, Haibing
    Zhang, Jiyong
    Yan, Chenggang
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 102
  • [50] Feature Aggregation and Propagation Network for Camouflaged Object Detection
    Zhou, Tao
    Zhou, Yi
    Gong, Chen
    Yang, Jian
    Zhang, Yu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 7036 - 7047