Multi-Modal Pedestrian Crossing Intention Prediction with Transformer-Based Model

被引:0
|
作者
Wang, Ting-Wei [1 ]
Lai, Shang-Hong [1 ]
机构
[1] Natl Tsing Hua Univ, Hsinchu, Taiwan
关键词
Pedestrian crossing intention prediction; multi-modal learning; transformer model; human posture;
D O I
10.1561/116.20240019
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Pedestrian crossing intention prediction based on computer vision plays a pivotal role in enhancing the safety of autonomous driving and advanced driver assistance systems. In this paper, we present a novel multi-modal pedestrian crossing intention prediction framework leveraging the transformer model. By integrating diverse sources of information and leveraging the transformer's sequential modeling and parallelization capabilities, our system accurately predicts pedestrian crossing intentions. We introduce a novel representation of traffic environment data and incorporate lifted 3D human pose and head orientation data to enhance the model's understanding of pedestrian behavior. Experimental results demonstrate the state-of-the-art accuracy of our proposed system on benchmark datasets.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] Pedestrian Crossing Intention Prediction with Multi-Modal Transformer-Based Model
    Wang, Ting Wei
    Lai, Shang-Hong
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1349 - 1356
  • [2] Anchor-based Multi-modal Transformer Network for Pedestrian Trajectory and Intention Prediction
    Lin, Yiwei
    Hu, Chuan
    Zhao, Baixuan
    Jiang, Hao
    Shan, Yonghang
    Ding, Taojun
    Zhang, Xi
    Proceedings of the 2023 7th CAA International Conference on Vehicular Control and Intelligence, CVCI 2023, 2023,
  • [3] Multi-modal Pedestrian Trajectory Prediction based on Pedestrian Intention for Intelligent Vehicle
    He, Youguo
    Sun, Yizhi
    Cai, Yingfeng
    Yuan, Chaochun
    Shen, Jie
    Tian, Liwei
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2024, 18 (06): : 1562 - 1582
  • [4] Stochastic Non-Autoregressive Transformer-Based Multi-Modal Pedestrian Trajectory Prediction for Intelligent Vehicles
    Chen, Xiaobo
    Zhang, Huanjia
    Deng, Fuwen
    Liang, Jun
    Yang, Jian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (05) : 3561 - 3574
  • [5] A Transformer-based Multi-modal Joint Attention Fusion Model for Molecular Property Prediction
    Wang, Ke
    Zhang, Wei
    Liu, Yong
    Proceedings - 2023 2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023, 2023, : 4972 - 4974
  • [6] Social Aware Multi-modal Pedestrian Crossing Behavior Prediction
    Zhai, Xiaolin
    Hu, Zhengxi
    Yang, Dingye
    Zhou, Lei
    Liu, Jingtai
    COMPUTER VISION - ACCV 2022, PT IV, 2023, 13844 : 275 - 290
  • [7] Multi-modal Motion Prediction with Transformer-based Neural Network for Autonomous Driving
    Huang, Zhiyu
    Mo, Xiaoyu
    Lv, Chen
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 2605 - 2611
  • [8] LLMT: A Transformer-Based Multi-Modal Lower Limb Human Motion Prediction Model for Assistive Robotics Applications
    Hossein Sadat Hosseini, S.
    Joojili, Nader N.
    Ahmadi, Mojtaba
    IEEE ACCESS, 2024, 12 : 82730 - 82741
  • [9] Movie tag prediction: An extreme multi-label multi-modal transformer-based solution with explanation
    Guarascio, Massimo
    Minici, Marco
    Pisani, Francesco Sergio
    De Francesco, Erika
    Lambardi, Pasquale
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, 62 (04) : 1021 - 1043
  • [10] Swin transformer-based GAN for multi-modal medical image translation
    Yan, Shouang
    Wang, Chengyan
    Chen, Weibo
    Lyu, Jun
    FRONTIERS IN ONCOLOGY, 2022, 12