Toward Generalizable Multispectral Pedestrian Detection

被引:2
|
作者
Chu, Fuchen [1 ,2 ]
Cao, Jiale [1 ,2 ]
Song, Zhanjie [1 ,2 ]
Shao, Zhuang [3 ]
Pang, Yanwei [1 ,2 ]
Li, Xuelong [4 ,5 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
[3] Univ Warwick, Warwick Mfg Grp, Coventry CV4 7AL, England
[4] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
[5] Northwestern Polytech Univ, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian 710072, Shaanxi, Peoples R China
关键词
Multispectral pedestrian detection; generalization; cross-dataset evaluation; intra-dataset evaluation; transformer; DEEP NEURAL-NETWORKS; FUSION; TIME;
D O I
10.1109/TITS.2023.3330155
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Multispectral pedestrian detection has achieved great success in past years, which can be used in autonomous driving for intelligent transportation system. Most existing multispectral pedestrian detection approaches are developed on the assumption that training and test data belong to an identical distribution, which does not guarantee a good generalization to cross-domain (unseen) data. In this paper, we aim to develop a generalizable multispectral pedestrian detector, which achieves a favorable performance on both intra-dataset evaluation and cross-dataset evaluation. To achieve this goal, we conduct intra-dataset and cross-dataset experiments using single-modal and multi-modal data. By deep analysis, we find that, compared to visible or multi-modal data, thermal data not only has a best cross-dataset generalization, but also generates high-quality proposals on intra-dataset and cross-dataset evaluations. Inspired by this, we propose a novel thermal-first and fusion-second network (called TFNet) for multispectral pedestrian detection. In our TFNet, we first employ a thermal-based proposal network to extract candidate pedestrian proposals. After that, we design a transformer fusion based head network to further classify/regress these proposals. Experiments are performed on three public datasets. The comprehensive results demonstrate the effectiveness of our proposed TFNet on both intra-dataset and cross-dataset evaluations. We hope that our simple design can promote the future study on generalizable multispectral pedestrian detection.
引用
收藏
页码:3739 / 3750
页数:12
相关论文
共 50 条
  • [1] Generalizable Pedestrian Detection: The Elephant In The Room
    Hasan, Irtiza
    Liao, Shengcai
    Li, Jinpeng
    Akram, Saad Ullah
    Shao, Ling
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11323 - 11332
  • [2] Multispectral pedestrian detection in autonomous driving: A review
    Li, Yuting
    IEIE Transactions on Smart Processing and Computing, 2021, 10 (01): : 10 - 16
  • [3] Multispectral Pedestrian Detection: Benchmark Dataset and Baseline
    Hwang, Soonmin
    Park, Jaesik
    Kim, Namil
    Choi, Yukyung
    Kweon, In So
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 1037 - 1045
  • [4] AN ALGORITHM FOR PEDESTRIAN DETECTION IN MULTISPECTRAL IMAGE SEQUENCES
    Kniaz, V. V.
    Fedorenko, V. V.
    INTERNATIONAL WORKSHOP PHOTOGRAMMETRIC AND COMPUTER VISION TECHNIQUES FOR VIDEO SURVEILLANCE, BIOMETRICS AND BIOMEDICINE, 2017, 42-2 (W4): : 73 - 77
  • [5] Unsupervised Domain Adaptation for Multispectral Pedestrian Detection
    Guan, Dayan
    Luo, Xing
    Cao, Yanpeng
    Yang, Jiangxin
    Cao, Yanlong
    Vosselman, George
    Yang, Michael Ying
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 434 - 443
  • [6] Attentive Alignment Network for Multispectral Pedestrian Detection
    Chen, Nuo
    Xie, Jin
    Nie, Jing
    Cao, Jiale
    Shao, Zhuang
    Pang, Yanwei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3787 - 3795
  • [7] A feature aggregation network for multispectral pedestrian detection
    Gong, Yan
    Wang, Lu
    Xu, Lisheng
    APPLIED INTELLIGENCE, 2023, 53 (19) : 22117 - 22131
  • [8] A feature aggregation network for multispectral pedestrian detection
    Yan Gong
    Lu Wang
    Lisheng Xu
    Applied Intelligence, 2023, 53 : 22117 - 22131
  • [9] Convolutional neural networks for multispectral pedestrian detection
    Ding, Lu
    Wang, Yong
    Laganiere, Robert
    Huang, Dan
    Fu, Shan
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 82
  • [10] Toward unlabeled multi-view 3D pedestrian detection by generalizable AI: techniques and performance analysis
    Lima, Joao Paulo
    Thomas, Diego
    Uchiyama, Hideaki
    Teichrieb, Veronica
    2023 36TH CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES, SIBGRAPI 2023, 2023, : 121 - 126