Pedestrian Crossing Intention Prediction with Multi-Modal Transformer-Based Model

被引:0
|
作者
Wang, Ting Wei [1 ]
Lai, Shang-Hong [1 ,2 ]
机构
[1] Natl Tsing Hua Univ, Inst Informat Syst & Applicat, Hsinchu, Taiwan
[2] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan
关键词
D O I
10.1109/APSIPAASC58517.2023.10317161
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The popularity of autonomous driving and advanced driver assistance systems can potentially reduce thousands of car accidents and casualties. In particular, pedestrian prediction and protection is an urgent development priority for such systems. Prediction of pedestrians' intentions of crossing the road or their actions can help such systems to assess the risk of pedestrians in front of vehicles in advance. In this paper, we propose a multi-modal pedestrian crossing intention prediction framework based on the transformer model to provide a better solution. Our method takes advantage of the excellent sequential modeling capability of the Transformer, enabling the model to perform stably in this task. We also propose to represent traffic environment information in a novel way, allowing such information can be efficiently exploited. Moreover, we include the lifted 3D human pose and 3D head orientation information estimated from pedestrian image into the model prediction, allowing the model to understand pedestrian posture better. Finally, our experimental results show the proposed system provides state-of-the-art accuracy on benchmarking datasets.
引用
收藏
页码:1349 / 1356
页数:8
相关论文
共 50 条
  • [1] Multi-Modal Pedestrian Crossing Intention Prediction with Transformer-Based Model
    Wang, Ting-Wei
    Lai, Shang-Hong
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2024, 13 (05)
  • [2] Anchor-based Multi-modal Transformer Network for Pedestrian Trajectory and Intention Prediction
    Lin, Yiwei
    Hu, Chuan
    Zhao, Baixuan
    Jiang, Hao
    Shan, Yonghang
    Ding, Taojun
    Zhang, Xi
    Proceedings of the 2023 7th CAA International Conference on Vehicular Control and Intelligence, CVCI 2023, 2023,
  • [3] Multi-modal Pedestrian Trajectory Prediction based on Pedestrian Intention for Intelligent Vehicle
    He, Youguo
    Sun, Yizhi
    Cai, Yingfeng
    Yuan, Chaochun
    Shen, Jie
    Tian, Liwei
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2024, 18 (06): : 1562 - 1582
  • [4] Stochastic Non-Autoregressive Transformer-Based Multi-Modal Pedestrian Trajectory Prediction for Intelligent Vehicles
    Chen, Xiaobo
    Zhang, Huanjia
    Deng, Fuwen
    Liang, Jun
    Yang, Jian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (05) : 3561 - 3574
  • [5] A Transformer-based Multi-modal Joint Attention Fusion Model for Molecular Property Prediction
    Wang, Ke
    Zhang, Wei
    Liu, Yong
    Proceedings - 2023 2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023, 2023, : 4972 - 4974
  • [6] Social Aware Multi-modal Pedestrian Crossing Behavior Prediction
    Zhai, Xiaolin
    Hu, Zhengxi
    Yang, Dingye
    Zhou, Lei
    Liu, Jingtai
    COMPUTER VISION - ACCV 2022, PT IV, 2023, 13844 : 275 - 290
  • [7] Multi-modal Motion Prediction with Transformer-based Neural Network for Autonomous Driving
    Huang, Zhiyu
    Mo, Xiaoyu
    Lv, Chen
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 2605 - 2611
  • [8] TrEP: Transformer-Based Evidential Prediction for Pedestrian Intention with Uncertainty
    Zhang, Zhengming
    Tian, Renran
    Ding, Zhengming
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3534 - 3542
  • [9] LLMT: A Transformer-Based Multi-Modal Lower Limb Human Motion Prediction Model for Assistive Robotics Applications
    Hossein Sadat Hosseini, S.
    Joojili, Nader N.
    Ahmadi, Mojtaba
    IEEE ACCESS, 2024, 12 : 82730 - 82741
  • [10] Movie tag prediction: An extreme multi-label multi-modal transformer-based solution with explanation
    Guarascio, Massimo
    Minici, Marco
    Pisani, Francesco Sergio
    De Francesco, Erika
    Lambardi, Pasquale
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, 62 (04) : 1021 - 1043