A feature aggregation network for multispectral pedestrian detection

被引：0

作者：

Yan Gong

Lu Wang

Lisheng Xu

机构：

[1] Northeastern University,School of Computer Science and Engineering

[2] Northeastern University,College of Medicine and Biomedical Information Engineering

来源：

Applied Intelligence | 2023年 / 53卷

关键词：

Multispectral pedestrian detection; Feature aggregation; Saliency map; Attention mechanism;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Pedestrian detection is an important task in many computer vision applications. Since multispectral pedestrian detection can alleviate the difficulties of insufficient illumination at night, it has been rapidly developed in recent years. However, the way for effective color-thermal image fusion still needs further research. In this paper, we propose a Feature Aggregation Module (FAM) that can adaptively capture the cross-channel and cross-dimension information interaction of the two modalities. In addition, we develop a Feature Aggregation Network (FANet) that embeds the proposed FAM module into a two-stream network adapted from the YOLOv5. FANet has the advantages that its size is small (15 MB) and it runs fast (8 ms per frame). Extensive experiments on the KAIST dataset show that the proposed method is effective for multispectral pedestrian detection, especially in the night-time condition, for which the Miss Rate is only 8.91%. Moreover, we show that the saliency map computed from the thermal image can be incorporated into FANet to further improve the detection accuracy. In order to verify the generalization ability of the FAM module, we have also conducted experiments on the person re-identification datasets, namely Market1501 and Duke. The performance of our FAM compares favorably against existing feature fusion mechanisms on the two datasets.

引用

页码：22117 / 22131

页数：14

共 50 条

[41] Pedestrian Detection Using Regional Proposal Network with Feature Fusion
Lv, Xiaogang
Zhang, Xiaotao
Jiang, Yinghua
Zhang, Jianxin
2018 EIGHTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2018, : 108 - 112
[42] MCANet: Multiscale Cross-Modality Attention Network for Multispectral Pedestrian Detection
Wang, Xiaotian
Zhao, Letian
Wu, Wei
Jin, Xi
MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 41 - 53
[43] Two-stream small-scale pedestrian detection network with feature aggregation for drone-view videos
Han Xie
Hyunchul Shin
Multidimensional Systems and Signal Processing, 2021, 32 : 897 - 913
[44] Two-stream small-scale pedestrian detection network with feature aggregation for drone-view videos
Xie, Han
Shin, Hyunchul
MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2021, 32 (03) : 897 - 913
[45] Integrated Feature Pyramid Network With Feature Aggregation for Traffic Sign Detection
Tang, Qing
Cao, Ge
Jo, Kang-Hyun
IEEE ACCESS, 2021, 9 : 117784 - 117794
[46] Cascaded information enhancement and cross-modal attention feature fusion for multispectral pedestrian detection
Yang, Yang
Xu, Kaixiong
Wang, Kaizheng
FRONTIERS IN PHYSICS, 2023, 11
[47] MFMANet: a multispectral pedestrian detection network using multi-resolution RGB feature reuse with multi-scale FIR attentions
Guo, Jiaren
Zhang, Yuzhen
Zheng, Jianyin
Huang, Zihao
Tao, Yanyun
MACHINE VISION AND APPLICATIONS, 2024, 35 (04)
[48] Deep Feature Aggregation Network for Hyperspectral Anomaly Detection
Cheng, Xi
Huo, Yu
Lin, Sheng
Dong, Youqiang
Zhao, Shaobo
Zhang, Min
Wang, Hai
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 1
[49] FANet: Feature aggregation network for RGBD saliency detection
Zhou, Xiaofei
Wen, Hongfa
Shi, Ran
Yin, Haibing
Zhang, Jiyong
Yan, Chenggang
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 102
[50] Feature Aggregation and Propagation Network for Camouflaged Object Detection
Zhou, Tao
Zhou, Yi
Gong, Chen
Yang, Jian
Zhang, Yu
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 7036 - 7047

← 1 2 3 4 5 →