Toward Generalizable Multispectral Pedestrian Detection

被引:2
|
作者
Chu, Fuchen [1 ,2 ]
Cao, Jiale [1 ,2 ]
Song, Zhanjie [1 ,2 ]
Shao, Zhuang [3 ]
Pang, Yanwei [1 ,2 ]
Li, Xuelong [4 ,5 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
[3] Univ Warwick, Warwick Mfg Grp, Coventry CV4 7AL, England
[4] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
[5] Northwestern Polytech Univ, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian 710072, Shaanxi, Peoples R China
关键词
Multispectral pedestrian detection; generalization; cross-dataset evaluation; intra-dataset evaluation; transformer; DEEP NEURAL-NETWORKS; FUSION; TIME;
D O I
10.1109/TITS.2023.3330155
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Multispectral pedestrian detection has achieved great success in past years, which can be used in autonomous driving for intelligent transportation system. Most existing multispectral pedestrian detection approaches are developed on the assumption that training and test data belong to an identical distribution, which does not guarantee a good generalization to cross-domain (unseen) data. In this paper, we aim to develop a generalizable multispectral pedestrian detector, which achieves a favorable performance on both intra-dataset evaluation and cross-dataset evaluation. To achieve this goal, we conduct intra-dataset and cross-dataset experiments using single-modal and multi-modal data. By deep analysis, we find that, compared to visible or multi-modal data, thermal data not only has a best cross-dataset generalization, but also generates high-quality proposals on intra-dataset and cross-dataset evaluations. Inspired by this, we propose a novel thermal-first and fusion-second network (called TFNet) for multispectral pedestrian detection. In our TFNet, we first employ a thermal-based proposal network to extract candidate pedestrian proposals. After that, we design a transformer fusion based head network to further classify/regress these proposals. Experiments are performed on three public datasets. The comprehensive results demonstrate the effectiveness of our proposed TFNet on both intra-dataset and cross-dataset evaluations. We hope that our simple design can promote the future study on generalizable multispectral pedestrian detection.
引用
收藏
页码:3739 / 3750
页数:12
相关论文
共 50 条
  • [41] Multispectral pedestrian detection network under modal adaptive weight learning mechanism
    Chen Y.
    Zhu Y.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2020, 28 (12): : 2700 - 2709
  • [42] Attention-based Cross-modality Interaction for Multispectral Pedestrian Detection
    Liu, Tianshan
    Zhao, Rui
    Lam, Kin-Man
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2021, 2021, 11766
  • [43] Attention Based Multi-Layer Fusion of Multispectral Images for Pedestrian Detection
    Zhang, Yongtao
    Yin, Zhishuai
    Nie, Linzhen
    Huang, Song
    IEEE ACCESS, 2020, 8 : 165071 - 165084
  • [44] Multi-layer fusion techniques using a CNN for multispectral pedestrian detection
    Chen, Yunfan
    Xie, Han
    Shin, Hyunchul
    IET COMPUTER VISION, 2018, 12 (08) : 1179 - 1187
  • [45] MAPD: multi-receptive field and attention mechanism for multispectral pedestrian detection
    Zang, Ying
    Cao, Runlong
    Li, Hui
    Hu, Wenjun
    Liu, Qingshan
    VISUAL COMPUTER, 2024, 40 (04): : 2819 - 2831
  • [46] MAPD: multi-receptive field and attention mechanism for multispectral pedestrian detection
    Ying Zang
    Runlong Cao
    Hui Li
    Wenjun Hu
    Qingshan Liu
    The Visual Computer, 2024, 40 : 2819 - 2831
  • [47] Multispectral Pedestrian Detection Based on Prior-Saliency Attention and Image Fusion
    Guo, Jiaren
    Huang, Zihao
    Tao, Yanyun
    ELECTRONICS, 2024, 13 (09)
  • [48] Pedestrian detection with unsupervised multispectral feature learning using deep neural networks
    Cao, Yanpeng
    Guan, Dayan
    Huang, Weilin
    Yang, Jiangxin
    Cao, Yanlong
    Qiao, Yu
    INFORMATION FUSION, 2019, 46 : 206 - 217
  • [49] Illumination-Guided Transformer-Based Network for Multispectral Pedestrian Detection
    Chu, Fuchen
    Cao, Jiale
    Shao, Zhuang
    Pang, Yanwei
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 343 - 355
  • [50] MultiSpectral Transformer Fusion via exploiting similarity and complementarity for robust pedestrian detection
    Song, Hou
    Yang, Meng
    Zheng, Wei-Shi
    Gao, Shibo
    PATTERN RECOGNITION, 2025, 162