Visual-Motion-Interaction-Guided Pedestrian Intention Prediction Framework

被引:6
|
作者
Sharma, Neha [1 ]
Dhiman, Chhavi [1 ]
Indu, S. [1 ]
机构
[1] Delhi Technol Univ DTU, Dept Elect & Commun & Engn, Delhi 110042, India
关键词
Autonomous vehicles (AVs); intention prediction; pedestrians;
D O I
10.1109/JSEN.2023.3317426
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The capability to comprehend the intention of pedestrians on the road is one of the most crucial skills that the current autonomous vehicles (AVs) are striving for, to become fully autonomous. In recent years, multi-modal methods have gained traction employing trajectory, appearance, and context for predicting pedestrian crossing intention. However, most existing research works still lag rich feature representational ability in a multimodal scenario, restricting their performance. Moreover, less emphasis is put on pedestrian interactions with the surroundings for predicting short-term pedestrian intention in a challenging ego-centric vision. To address these challenges, an efficient visual-motion-interaction-guided (VMI) intention prediction framework has been proposed. This framework comprises visual encoder (VE), motion encoder (ME), and interaction encoder (IE) to capture rich multimodal features of the pedestrian and its interactions with the surroundings, followed by temporal attention and adaptive fusion (AF) module (AFM) to integrate these multimodal features efficiently. The proposed framework outperforms several SOTA on benchmark datasets: Pedestrian Intention Estimation (PIE)/Joint Attention in Autonomous Driving (JAAD) with accuracy, AUC, F1-score, precision, and recall as 0.92/0.89, 0.91/0.90, 0.87/0.81, 0.86/0.79, and 0.88/0.83, respectively. Furthermore, extensive experiments are carried out to investigate different fusion architectures and design parameters of all encoders. The proposed VMI framework predicts pedestrian crossing intention 2.5 s ahead of the crossing event. Code is available at: https://github.com/neha013/VMI.git.
引用
收藏
页码:27540 / 27548
页数:9
相关论文
共 50 条
  • [41] Pedestrian Crossing Intention Prediction Model Considering Social Interaction between Multi-Pedestrians and Multi-Vehicles
    Zhou, Zhuping
    Liu, Yang
    Liu, Bowen
    Ouyang, Molan
    Tang, Ruiyao
    TRANSPORTATION RESEARCH RECORD, 2024, 2678 (05) : 80 - 101
  • [42] Learning Time Series Models for Pedestrian Motion Prediction
    Zhou, Chenghui
    Balle, Borja
    Pineau, Joelle
    2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 3323 - 3330
  • [43] WatchPed: Pedestrian Crossing Intention Prediction Using Embedded Sensors of Smartwatch
    Abbasi, Jibran Ali
    Imran, Navid Mohammad
    Das, Lokesh Chandra
    Won, Myounggyu
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 9574 - 9581
  • [44] Probabilistic Prediction of Pedestrian Crossing Intention Using Roadside LiDAR Data
    Zhao, Junxuan
    Li, Yinfeng
    Xu, Hao
    Liu, Hongchao
    IEEE ACCESS, 2019, 7 : 93781 - 93790
  • [45] Pedestrian Crossing Intention Prediction Method Based on Multimodal Feature Fusion
    Chen, Long
    Yang, Chen
    Cai, Yingfeng
    Wang, Hai
    Li, Yicheng
    Qiche Gongcheng/Automotive Engineering, 2023, 45 (10): : 1779 - 1790
  • [46] Learn from IoT: Pedestrian Detection and Intention Prediction for Autonomous Driving
    Solmaz, Gurkan
    Berz, Everton Luis
    Dolatabadi, Marzieh Farahani
    Aytac, Samet
    Furst, Jonathan
    Cheng, Bin
    den Ouden, Jos
    PROCEEDINGS OF THE 1ST ACM WORKSHOP ON EMERGING SMART TECHNOLOGIES AND INFRASTRUCTURES FOR SMART MOBILITY AND SUSTAINABILITY (SMAS '19), 2019, : 27 - 32
  • [47] Pedestrian Intention and Pose Prediction through Dynamical Models and Behaviour Classification
    Quintero, R.
    Parra, I.
    Llorca, D. F.
    Sotelo, M. A.
    2015 IEEE 18TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, : 83 - 88
  • [48] Prediction of Pedestrian Intention and Trajectory Based on Multi-feature Fusion
    Cao H.-T.
    Shi H.-J.
    Song X.-L.
    Li M.-J.
    Dai H.-L.
    Huang Z.
    Zhongguo Gonglu Xuebao/China Journal of Highway and Transport, 2022, 35 (10): : 308 - 318
  • [49] TrEP: Transformer-Based Evidential Prediction for Pedestrian Intention with Uncertainty
    Zhang, Zhengming
    Tian, Renran
    Ding, Zhengming
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3534 - 3542
  • [50] MCIP: Multi-Stream Network for Pedestrian Crossing Intention Prediction
    Ham, Je-Seok
    Bae, Kangmin
    Moon, Jinyoung
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 13801 LNCS : 663 - 679