M2Tames: Interaction and Semantic Context Enhanced Pedestrian Trajectory Prediction

被引:0
|
作者
Gao, Xu [1 ,2 ]
Wang, Yanan [1 ,2 ]
Zhao, Yaqian [1 ,2 ]
Li, Yilong [3 ]
Wu, Gang [1 ,2 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Peoples R China
[2] Natl Supercomp Ctr Zhengzhou, Zhengzhou 450001, Peoples R China
[3] Henan Univ, Sch Comp & Informat Engn, Kaifeng 475000, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 18期
关键词
trajectory prediction; attention mechanism; autonomous driving; deep learning;
D O I
10.3390/app14188497
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Autonomous driving pays considerable attention to pedestrian trajectory prediction as a crucial task. Constructing effective pedestrian trajectory prediction models depends heavily on utilizing the motion characteristics of pedestrians, along with their interactions among themselves and between themselves and their environment. However, traditional trajectory prediction models often fall short of capturing complex real-world scenarios. To address these challenges, this paper proposes an enhanced pedestrian trajectory prediction model, M(2)Tames, which incorporates comprehensive motion, interaction, and semantic context factors. M(2)Tames provides an interaction module (IM), which consists of an improved multi-head mask temporal attention mechanism (M(2)Tea) and an Interaction Inference Module (I-2). M(2)Tea thoroughly characterizes the historical trajectories and potential interactions, while I-2 determines the precise interaction types. Then, IM adaptively aggregates useful neighbor features to generate a more accurate interactive feature map and feeds it into the final layer of the U-Net encoder to fuse with the encoder's output. Furthermore, by adopting the U-Net architecture, M(2)Tames can learn and interpret scene semantic information, enhancing its understanding of the spatial relationships between pedestrians and their surroundings. These innovations improve the accuracy and adaptability of the model for predicting pedestrian trajectories. Finally, M(2)Tames is evaluated on the ETH/UCY and SDD datasets for short- and long-term settings, respectively. The results demonstrate that M(2)Tames outperforms the state-of-the-art model MSRL by 2.49% (ADE) and 8.77% (FDE) in the short-term setting and surpasses the optimum Y-Net by 6.89% (ADE) and 1.12% (FDE) in the long-term prediction. Excellent performance is also shown on the ETH/UCY datasets.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Multi-level context-driven interaction modeling for human future trajectory prediction
    Zhiquan He
    Hao Sun
    Wenming Cao
    Henry Z. He
    Neural Computing and Applications, 2022, 34 : 20101 - 20115
  • [42] Multi-level context-driven interaction modeling for human future trajectory prediction
    He, Zhiquan
    Sun, Hao
    Cao, Wenming
    He, Henry Z.
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (22): : 20101 - 20115
  • [43] M2I: From Factored Marginal Trajectory Prediction to Interactive Prediction
    Sun, Qiao
    Huang, Xin
    Gu, Junru
    Williams, Brian C.
    Zhao, Hang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6533 - 6542
  • [44] Ballistic Trajectory Prediction Based on Context-enhanced Long Short-Term Memory Network
    Ren J.
    Wu X.
    Bo Y.
    Wu P.
    He S.
    Binggong Xuebao/Acta Armamentarii, 2023, 44 (02): : 462 - 471
  • [45] Correction to: Multi-level context-driven interaction modeling for human future trajectory prediction
    Zhiquan He
    Hao Sun
    Wenming Cao
    Henry Z. He
    Neural Computing and Applications, 2023, 35 : 20441 - 20441
  • [46] Hierarchical Multi-Supervision Multi-Interaction Graph Attention Network for Multi-Camera Pedestrian Trajectory Prediction
    Zhao, Guoliang
    Zhou, Yuxun
    Xu, Zhanbo
    Zhou, Yadong
    Wu, Jiang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 4698 - 4706
  • [47] Multiagent trajectory prediction with global-local scene-enhanced social interaction graph network
    Lin, Xuanqi
    Zhang, Yong
    Wang, Shun
    Piao, Xinglin
    Yin, Baocai
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2024, 35 (03)
  • [48] T2FPV: Dataset and Method for Correcting First-Person View Errors in Pedestrian Trajectory Prediction
    Stoler, Benjamin
    Jana, Meghdeep
    Hwang, Soonmin
    Oh, Jean
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 4037 - 4044
  • [49] AST-GNN: An attention-based spatio-temporal graph neural network for Interaction-aware pedestrian trajectory prediction
    Zhou, Hao
    Ren, Dongchun
    Xia, Huaxia
    Fan, Mingyu
    Yang, Xu
    Huang, Hai
    NEUROCOMPUTING, 2021, 445 : 298 - 308
  • [50] Social-IWSTCNN: A Social Interaction-Weighted Spatio-Temporal Convolutional Neural Network for Pedestrian Trajectory Prediction in Urban Traffic Scenarios
    Zhang, Chi
    Berger, Christian
    Dozza, Marco
    2021 32ND IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2021, : 1515 - 1522