Toward Generalizable Multispectral Pedestrian Detection

被引:2
|
作者
Chu, Fuchen [1 ,2 ]
Cao, Jiale [1 ,2 ]
Song, Zhanjie [1 ,2 ]
Shao, Zhuang [3 ]
Pang, Yanwei [1 ,2 ]
Li, Xuelong [4 ,5 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
[3] Univ Warwick, Warwick Mfg Grp, Coventry CV4 7AL, England
[4] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
[5] Northwestern Polytech Univ, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian 710072, Shaanxi, Peoples R China
关键词
Multispectral pedestrian detection; generalization; cross-dataset evaluation; intra-dataset evaluation; transformer; DEEP NEURAL-NETWORKS; FUSION; TIME;
D O I
10.1109/TITS.2023.3330155
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Multispectral pedestrian detection has achieved great success in past years, which can be used in autonomous driving for intelligent transportation system. Most existing multispectral pedestrian detection approaches are developed on the assumption that training and test data belong to an identical distribution, which does not guarantee a good generalization to cross-domain (unseen) data. In this paper, we aim to develop a generalizable multispectral pedestrian detector, which achieves a favorable performance on both intra-dataset evaluation and cross-dataset evaluation. To achieve this goal, we conduct intra-dataset and cross-dataset experiments using single-modal and multi-modal data. By deep analysis, we find that, compared to visible or multi-modal data, thermal data not only has a best cross-dataset generalization, but also generates high-quality proposals on intra-dataset and cross-dataset evaluations. Inspired by this, we propose a novel thermal-first and fusion-second network (called TFNet) for multispectral pedestrian detection. In our TFNet, we first employ a thermal-based proposal network to extract candidate pedestrian proposals. After that, we design a transformer fusion based head network to further classify/regress these proposals. Experiments are performed on three public datasets. The comprehensive results demonstrate the effectiveness of our proposed TFNet on both intra-dataset and cross-dataset evaluations. We hope that our simple design can promote the future study on generalizable multispectral pedestrian detection.
引用
收藏
页码:3739 / 3750
页数:12
相关论文
共 50 条
  • [21] Transformer fusion and histogram layer multispectral pedestrian detection network
    Ying Zang
    Chenglong Fu
    Dongsheng Yang
    Hui Li
    Chaotao Ding
    Qingshan Liu
    Signal, Image and Video Processing, 2023, 17 : 3545 - 3553
  • [22] Multispectral pedestrian detection based on deep convolutional neural networks
    Hou, Ya-Li
    Song, Yaoyao
    Hao, Xiaoli
    Shen, Yan
    Qian, Manyi
    Chen, Houjin
    INFRARED PHYSICS & TECHNOLOGY, 2018, 94 : 69 - 77
  • [23] Pedestrian Detection Using Multispectral Images and a Deep Neural Network
    Nataprawira, Jason
    Gu, Yanlei
    Goncharenko, Igor
    Kamijo, Shunsuke
    SENSORS, 2021, 21 (07)
  • [24] Multispectral Pedestrian Detection Based on Deep Convolutional Neural Networks
    Hou, Ya-Li
    Song, Yaoyao
    Tao, Xiaoli
    Shen, Yan
    Qian, Manyi
    2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2017,
  • [25] Illumination-aware Multispectral Fusion Network for Pedestrian Detection
    Peng P.
    Ren S.
    Li J.
    Zhou H.
    Xu T.
    Binggong Xuebao/Acta Armamentarii, 2023, 44 (09): : 2622 - 2630
  • [26] Transformer fusion and histogram layer multispectral pedestrian detection network
    Zang, Ying
    Fu, Chenglong
    Yang, Dongsheng
    Li, Hui
    Ding, Chaotao
    Liu, Qingshan
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (07) : 3545 - 3553
  • [27] HAFNet: Hierarchical Attentive Fusion Network for Multispectral Pedestrian Detection
    Peng, Peiran
    Xu, Tingfa
    Huang, Bo
    Li, Jianan
    REMOTE SENSING, 2023, 15 (08)
  • [28] Counting Canola: Toward Generalizable Aerial Plant Detection Models
    Andvaag, Erik
    Krys, Kaylie
    Shirtliffe, Steven J.
    Stavness, Ian
    PLANT PHENOMICS, 2024, 6
  • [29] Toward Robust Pedestrian Detection With Data Augmentation
    Cygert, Sebastian
    Czyzewski, Andrzej
    IEEE ACCESS, 2020, 8 (08): : 136674 - 136683
  • [30] Cross-modality complementary information fusion for multispectral pedestrian detection
    Chaoqi Yan
    Hong Zhang
    Xuliang Li
    Yifan Yang
    Ding Yuan
    Neural Computing and Applications, 2023, 35 : 10361 - 10386