MULTI-SCALE DEFORMABLE TRANSFORMER ENCODER BASED SINGLE-STAGE PEDESTRIAN DETECTION

被引:1
|
作者
Yuan, Jing [1 ]
Barmpoutis, Panagiotis [2 ]
Stathaki, Tania [1 ]
机构
[1] Imperial Coll London, Dept Elect & Elect Engn, London, England
[2] UCL, Dept Comp Sci, London, England
关键词
Pedestrian detection; single-stage method; vision transformer; NETWORK;
D O I
10.1109/ICIP46576.2022.9897361
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pedestrian detection is a key task in intelligent video surveillance systems which requires both fast inference and high detection accuracy. Although single-stage deep learning pedestrian detectors have achieved relatively high detection accuracy with simpler architecture and less inference time, their performance is limited compared to two-stage methods. The reason is the lack of scale-aware features without the assistance of proposal regions. To overcome this, a multiscale deformable transformer encoder-based module is proposed. It can extract the sparse important features at deformable sampling locations from multiple levels. The proposed architecture significantly improves the performance compared to the baseline center and scale prediction method on both Caltech and Citypersons datasets. It even outperforms the state-of-the-art two-stage methods in detecting heavily occluded pedestrians on Citypersons validation set.
引用
收藏
页码:2906 / 2910
页数:5
相关论文
共 50 条
  • [1] Multi-spectral Pedestrian Detection Based on Deformable Convolution and Multi-Scale Residual Attention
    Zhang Guoli
    Chang Shuai
    Song Yansong
    Liu Tianci
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (10)
  • [2] Effectiveness of Vision Transformer for Fast and Accurate Single-Stage Pedestrian Detection
    Yuan, Jing
    Barmpoutis, Panagiotis
    Stathaki, Tania
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [3] Efficient Single-Stage Pedestrian Detector by Asymptotic Localization Fitting and Multi-Scale Context Encoding
    Liu, Wei
    Liao, Shengcai
    Hu, Weidong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 1413 - 1425
  • [4] Multi-scale single-stage pose detection with adaptive sample training in the classroom scene
    Gao, Chenqiang
    Ye, Sheng
    Tian, Hang
    Yan, Yan
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 222
  • [5] Pedestrian Detection Based on Multi-Scale Fusion Features
    Jiang, Hao
    Zhang, Chuang
    Wu, Ming
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 329 - 333
  • [6] A pedestrian detection algorithm based on sparse multi-scale image segmentation and cascade deformable part model
    Lv R.
    Shao Z.
    [J]. Shao, Zhenfeng (shaozhenfeng@whu.edu.cn), 1600, Editorial Board of Medical Journal of Wuhan University (41): : 1544 - 1549
  • [7] FPGA Implementation of HOG based Multi-Scale Pedestrian Detection
    Wang, Ming-Shi
    Zhang, Zhe-Rong
    [J]. PROCEEDINGS OF 4TH IEEE INTERNATIONAL CONFERENCE ON APPLIED SYSTEM INNOVATION 2018 ( IEEE ICASI 2018 ), 2018, : 1099 - 1102
  • [8] Pedestrian Re-Identification Based on CNN and TransFormer Multi-scale Learning
    Chen, Ying
    Kuang, Cheng
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (06) : 2256 - 2263
  • [9] ICDT: Maintaining Interaction Consistency for Deformable Transformer with Multi-scale Features in HOI Detection
    Guo, Bingnan
    Liu, Sheng
    Zhang, Feng
    Chen, Junhao
    Chen, Ruixiang
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI, 2023, 14259 : 433 - 445
  • [10] Multi-Scale Deformable Transformer for Banknote Serial Number Recognition
    Zhang, Kaisheng
    Li, Xuyang
    [J]. Computer Engineering and Applications, 2023, 59 (18) : 105 - 118