Multi-scale pedestrian intent prediction using 3D joint information as spatio-temporal representation

被引:9
|
作者
Ahmed, Sarfraz [1 ]
Al Bazi, Ammar [2 ]
Saha, Chitta [1 ]
Rajbhandari, Sujan [3 ]
Huda, M. Nazmul [4 ]
机构
[1] Coventry Univ, Sch Future Transport Engn, Priory St, Coventry CV1 5FB, W Midlands, England
[2] Coventry Univ, Sch Mech Engn, Priory St, Coventry CV1 5FB, W Midlands, England
[3] Bangor Univ, DSP Ctr Excellence, Sch Comp Sci & Elect Engn, Bangor LL57 2DG, Gwynedd, Wales
[4] Brunel Univ London, Dept Elect & Elect Engn, Kingston Lane, London UB8 3PH, England
关键词
LSTM; Intent prediction; Pose estimation; Tracking; Pedestrian detection; TRAJECTORY PREDICTION; MODEL;
D O I
10.1016/j.eswa.2023.120077
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There has been a rise of use of Autonomous Vehicles on public roads. With the predicted rise of road traffic accidents over the coming years, these vehicles must be capable of safely operate in the public domain. The field of pedestrian detection has significantly advanced in the last decade, providing high-level accuracy, with some technique reaching near-human level accuracy. However, there remains further work required for pedestrian intent prediction to reach human-level performance. One of the challenges facing current pedestrian intent predictors are the varying scales of pedestrians, particularly smaller pedestrians. This is because smaller pedestrians can blend into the background, making them difficult to detect, track or apply pose estimations techniques. Therefore, in this work, we present a novel intent prediction approach for multi-scale pedestrians using 2D pose estimation and a Long Short-term memory (LSTM) architecture. The pose estimator predicts keypoints for the pedestrian along the video frames. Based on the accumulation of these keypoints along the frames, spatio-temporal data is generated. This spatio-temporal data is fed to the LSTM for classifying the crossing behaviour of the pedestrians. We evaluate the performance of the proposed techniques on the popular Joint Attention in Autonomous Driving (JAAD) dataset and the new larger-scale Pedestrian Intention Estimation (PIE) dataset. Using data generalisation techniques, we show that the proposed technique outperformed the state-of-the-art techniques by up to 7%, reaching up to 94% accuracy while maintaining a comparable run-time of 6.1 ms.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Dynamic Spatio-Temporal Multi-Scale Representation for Bus Ridership Prediction
    Peng, Lilan
    Wang, Xiu
    Lu, Hongchun
    Guo, Xiangyu
    Li, Tianrui
    Ji, Shenggong
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [2] Spatio-temporal Multi-scale Pedestrian Flow Model by using Attention Module
    Sakurai, Akihiro
    Yamamoto, Ko
    2024 33RD IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, ROMAN 2024, 2024, : 1842 - 1847
  • [3] MSTG: Multi-Scale Transformer with Gradient for joint spatio-temporal enhancement
    Lin, Xin
    Chen, Junli
    Ai, Shaojie
    Liu, Jing
    Li, Bochao
    Li, Qingying
    Ma, Rui
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 102
  • [4] Local Spatio-Temporal Representation Using the 3D Shearlet Transform
    Damiano Malafronte
    Ernesto De Vito
    Francesca Odone
    Sampling Theory in Signal and Image Processing, 2018, 17 (1): : 57 - 72
  • [5] Local Spatio-Temporal Representation using the 3D Shearlet Transform
    Malafronte, Damiano
    Odone, Francesca
    De Vito, Ernesto
    2017 INTERNATIONAL CONFERENCE ON SAMPLING THEORY AND APPLICATIONS (SAMPTA), 2017, : 585 - 589
  • [6] A Spatio-Temporal 3D Representation of a Historic Dataset
    Papasarantou, Chrissa
    Kalaouzis, Giorgos
    Pentazou, Ioulia
    Bourdakis, Vassilis
    ECAADE 2015: REAL TIME - EXTENDING THE REACH OF COMPUTATION, VOL 1, 2015, : 701 - 708
  • [7] Multi-scale Spatio-temporal Attention Network for Traffic Flow Prediction
    Li, Minghao
    Li, Jinhong
    Ta, Xuxiang
    Bai, Yanbo
    Hao, Xinzhe
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT II, ICIC 2024, 2024, 14876 : 294 - 305
  • [8] Cell Traffic Prediction Using Joint Spatio-Temporal Information
    Lovisotto, Enrico
    Vianello, Enrico
    Cazzaro, Davide
    Polese, Michele
    Chiariotti, Federico
    Zucchetto, Daniel
    Zanella, Andrea
    Zorzi, Michele
    2017 6TH INTERNATIONAL CONFERENCE ON MODERN CIRCUITS AND SYSTEMS TECHNOLOGIES (MOCAST), 2017,
  • [9] A Systematic Review of Multi-Scale Spatio-Temporal Crime Prediction Methods
    Du, Yingjie
    Ding, Ning
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2023, 12 (06)
  • [10] MSSTN: a multi-scale spatio-temporal network for traffic flow prediction
    Song, Yun
    Bai, Xinke
    Fan, Wendong
    Deng, Zelin
    Jiang, Cong
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (07) : 2827 - 2841