SW-YOLOX: A YOLOX-based real-time pedestrian detector with shift window-mixed attention mechanism

被引:2
|
作者
Tsai, Chi-Yi [1 ]
Wang, Run-Yu [1 ]
Chiu, Yu-Chen [1 ]
机构
[1] Tamkang Univ, Dept Elect & Comp Engn, 151 Yingzhuan Rd, New Taipei City 251, Taiwan
关键词
Deep learning; Pedestrian detection; Attention mechanism; Feature Pyramid Network; EFFICIENT; NETWORK;
D O I
10.1016/j.neucom.2024.128357
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pedestrian detection is a critical research area in computer vision with practical applications. This paper addresses this key topic by providing a novel lightweight model named Shift Window-YOLOX (SW-YOLOX). The purpose of SW-YOLOX is to significantly enhance the robustness and real-time performance of pedestrian detection under practical application requirements. The proposed method incorporates a novel Shift Window- Mixed Attention Mechanism (SW-MAM), which combines spatial and channel attention for effective feature extraction. In addition, we introduce a novel up-sampling layer, PatchExpandingv2, to enhance spatial feature representation while maintaining computational efficiency. Furthermore, we propose a novel Shift Window-Path Aggregation Feature Pyramid Network (SW-PAFPN) to integrate with the YOLOX detector, further enhancing feature extraction and the robustness of pedestrian detection. Experimental results validated on challenging datasets such as CrowdHuman, MOT17Det, and MOT20Det demonstrate the competitive performance of the proposed SW-YOLOX compared to state-of-the-art methods and its pedestrian detection performance in crowded and complex scenes.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] A real-time fire detection method from video for electric vehicle-charging stations based on improved YOLOX-tiny
    Yifan Ju
    Dexin Gao
    Shiyu Zhang
    Qing Yang
    Journal of Real-Time Image Processing, 2023, 20
  • [22] Real-Time Ultrasound Image Despeckling Using Mixed-Attention Mechanism Based Residual UNet
    Lan, Yancheng
    Zhang, Xuming
    IEEE ACCESS, 2020, 8 : 195327 - 195340
  • [23] Real-time robust visual tracking based on spatial attention mechanism
    Ma S.
    Zhang Z.
    Pu L.
    Hou Z.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2024, 50 (02): : 419 - 432
  • [24] A HOG-based Real-time and Multi-scale Pedestrian Detector Demonstration System on FPGA
    Duerre, Jan
    Paradzik, Dario
    Blume, Holger
    PROCEEDINGS OF THE 2018 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'18), 2018, : 163 - 172
  • [25] Real-time topology optimization based on multi-scale convolutional attention mechanism
    Zhang, Wei
    Su, Lijie
    Wang, Xianpeng
    ENGINEERING OPTIMIZATION, 2024,
  • [26] Real-Time Monitoring for Hydraulic States Based on Convolutional Bidirectional LSTM with Attention Mechanism
    Kim, Kyutae
    Jeong, Jongpil
    SENSORS, 2020, 20 (24) : 1 - 17
  • [27] Real-Time Imputation Model for Missing Sensor Data Based on Alternating Attention Mechanism
    Zhang, Mingxian
    Zhao, Ran
    Wang, Cong
    Jing, Ling
    Li, Daoliang
    IEEE SENSORS JOURNAL, 2025, 25 (05) : 8962 - 8974
  • [28] Research on Real-Time Face Key Point Detection Algorithm Based on Attention Mechanism
    Gao, Jiangjin
    Yang, Tao
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [29] A Real-time Detection Algorithm Based on Nanodet for Pavement Cracks by Incorporating Attention Mechanism
    Yong, Pengfei
    Li, Suoling
    Wang, Kun
    Zhu, Yupeng
    2022 8TH INTERNATIONAL CONFERENCE ON HYDRAULIC AND CIVIL ENGINEERING: DEEP SPACE INTELLIGENT DEVELOPMENT AND UTILIZATION FORUM, ICHCE, 2022, : 1245 - 1250
  • [30] Diversified real-time user interest recommendation based on self-attention mechanism
    Hu C.
    Chen C.
    Chen T.
    Miao H.
    Chen W.
    Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2023, 43 (09): : 2579 - 2594