Toward Generalizable Multispectral Pedestrian Detection

被引:2
|
作者
Chu, Fuchen [1 ,2 ]
Cao, Jiale [1 ,2 ]
Song, Zhanjie [1 ,2 ]
Shao, Zhuang [3 ]
Pang, Yanwei [1 ,2 ]
Li, Xuelong [4 ,5 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
[3] Univ Warwick, Warwick Mfg Grp, Coventry CV4 7AL, England
[4] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
[5] Northwestern Polytech Univ, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian 710072, Shaanxi, Peoples R China
关键词
Multispectral pedestrian detection; generalization; cross-dataset evaluation; intra-dataset evaluation; transformer; DEEP NEURAL-NETWORKS; FUSION; TIME;
D O I
10.1109/TITS.2023.3330155
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Multispectral pedestrian detection has achieved great success in past years, which can be used in autonomous driving for intelligent transportation system. Most existing multispectral pedestrian detection approaches are developed on the assumption that training and test data belong to an identical distribution, which does not guarantee a good generalization to cross-domain (unseen) data. In this paper, we aim to develop a generalizable multispectral pedestrian detector, which achieves a favorable performance on both intra-dataset evaluation and cross-dataset evaluation. To achieve this goal, we conduct intra-dataset and cross-dataset experiments using single-modal and multi-modal data. By deep analysis, we find that, compared to visible or multi-modal data, thermal data not only has a best cross-dataset generalization, but also generates high-quality proposals on intra-dataset and cross-dataset evaluations. Inspired by this, we propose a novel thermal-first and fusion-second network (called TFNet) for multispectral pedestrian detection. In our TFNet, we first employ a thermal-based proposal network to extract candidate pedestrian proposals. After that, we design a transformer fusion based head network to further classify/regress these proposals. Experiments are performed on three public datasets. The comprehensive results demonstrate the effectiveness of our proposed TFNet on both intra-dataset and cross-dataset evaluations. We hope that our simple design can promote the future study on generalizable multispectral pedestrian detection.
引用
收藏
页码:3739 / 3750
页数:12
相关论文
共 50 条
  • [31] Learning a Dynamic Cross-Modal Network for Multispectral Pedestrian Detection
    Xie, Jin
    Anwer, Rao Muhammad
    Cholakkal, Hisham
    Nie, Jing
    Cao, Jiale
    Laaksonen, Jorma
    Khan, Fahad Shahbaz
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4043 - 4052
  • [32] Cross-modality interactive attention network for multispectral pedestrian detection
    Zhang, Lu
    Liu, Zhiyong
    Zhang, Shifeng
    Yang, Xu
    Qiao, Hong
    Huang, Kaizhu
    Hussain, Amir
    INFORMATION FUSION, 2019, 50 : 20 - 29
  • [33] Cross-modality complementary information fusion for multispectral pedestrian detection
    Yan, Chaoqi
    Zhang, Hong
    Li, Xuliang
    Yang, Yifan
    Yuan, Ding
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (14): : 10361 - 10386
  • [34] Feature Map Swap: Multispectral Data Fusion Method for Pedestrian Detection
    Ryu, Junhwan
    Kim, Sungho
    2019 19TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2019), 2019, : 319 - 323
  • [35] Pseudo-Multispectral Pedestrian Detection with Deep Thermal Feature Guidance
    Chu, Fuchen
    Pang, Yanwei
    Sun, Xuebin
    Cao, Jiale
    Song, Zhanjie
    GUIDANCE NAVIGATION AND CONTROL, 2024, 04 (03)
  • [36] Pseudo-Multispectral Pedestrian Detection with Deep Thermal Feature Guidance
    Fuchen Chu
    Yanwei Pang
    Xuebin Sun
    Jiale Cao
    Zhanjie Song
    Guidance,Navigation and Control, 2024, (03) : 92 - 107
  • [37] Mask-guided explicit feature modulation for multispectral pedestrian detection
    Shen, Jifeng
    Liu, Yue
    Chen, Yifei
    Zuo, Xin
    Li, Jun
    Yang, Wankou
    Computers and Electrical Engineering, 2022, 103
  • [38] Weakly Aligned Cross-Modal Learning for Multispectral Pedestrian Detection
    Zhang, Lu
    Zhu, Xiangyu
    Chen, Xiangyu
    Yang, Xu
    Lei, Zhen
    Liu, Zhiyong
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5126 - 5136
  • [39] Mask-guided explicit feature modulation for multispectral pedestrian detection
    Shen, Jifeng
    Liu, Yue
    Chen, Yifei
    Zuo, Xin
    Li, Jun
    Yang, Wankou
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 103
  • [40] Cross-Modal Attentive Recalibration and Dynamic Fusion for Multispectral Pedestrian Detection
    Bao, Wei
    Hu, Jingjing
    Huang, Meiyu
    Xiang, Xueshuang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 499 - 510