Learning From Interaction-Enhanced Scene Graph for Pedestrian Collision Risk Assessment

被引:7
|
作者
Liu, Xinxin [1 ]
Zhou, Yuchen [1 ]
Gou, Chao [1 ]
机构
[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen 518107, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Traffic scene graphs; collision risk assessment; autonomous driving systems; BENCHMARK DATASET; PREDICTION; BEHAVIOR;
D O I
10.1109/TIV.2023.3309274
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Collision risk assessment aims to provide a subjective cognitive comprehension of the risk level in driving scenarios, which is critical for the safety of autonomous driving systems. Pedestrian crossing scenarios contain intricate human-vehicle interactions. Hence, it is important to capture the rich relations between traffic entities and to assess the collision risk promptly to ensure safety. Existing studies focus on modeling the spatial relationships between the ego-vehicle and other vehicles in typical traffic scenarios, while ignoring the complex interactions between pedestrians and the ego-vehicle in critical driving scenarios. To address this issue, we propose a novel approach that involves constructing traffic scene graphs with enhanced vehicle-pedestrian interactions, along with introducing an innovative deep model built upon Transformer and GCN for pedestrian collision risk assessment. Specifically, to facilitate spatio-temporal modeling of traffic scene graph sequence, we propose a novel unified framework that integrates Multi-Relation Graph Convolution Network (MR-GCN) and Temporal Transformer Encoder. In addition, two variants of traffic scene graph datasets termed as Interaction-Enhanced Scene Graph (IESG) and None-Interaction-Enhanced Scene Graph (Non-IESG) are created for the purpose of assessing pedestrian collision risk, utilizing the CAP-DATA and JAAD respectively. Experiments are conducted on our newly created traffic scene graph datasets of pedestrian crossing scenes. The results on the IESG dataset show that our model outperforms the baseline model with higher accuracy (94% vs. 84%), higher AUC (98% vs. 89%), and higher F1-score (93% vs. 84%). IESG and Non-IESG datasets are available at https://github. com/Pedestrian-Crossing-Collision-Risk-Assessment-Datasets.
引用
收藏
页码:4237 / 4248
页数:12
相关论文
共 50 条
  • [1] Dynamic Attention-Enhanced Spatio-Temporal Network for Pedestrian Collision Risk Assessment
    Gao, Hui
    Wang, Benfei
    Liu, Xinxin
    Zhou, Yuchen
    Gou, Chao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 207 - 221
  • [2] Edge Feature-Enhanced Network for Collision Risk Assessment Using Traffic Scene Graphs
    Liu, Xinxin
    Zhou, Yuchen
    Ye, Yongqi
    Gou, Chao
    IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2024,
  • [3] Learning Scene-Pedestrian Graph for End-to-End Person Search
    Song, Zifan
    Zhao, Cairong
    Hu, Guosheng
    Miao, Duoqian
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (02) : 2979 - 2990
  • [4] IeMTLF: Interaction-enhanced Multi-Task Learning Framework for next location prediction
    Wang, Yahui
    Chen, Hongchang
    Liu, Shuxin
    Wang, Kai
    Li, Xing
    Hu, Yuxiang
    INFORMATION SCIENCES, 2024, 661
  • [5] Probabilistic risk assessment for pedestrian-vehicle collision considering uncertainties of pedestrian mobility
    Huang, Zhi
    Liu, Xiangyi
    Song, Xiaolin
    He, Yin
    TRAFFIC INJURY PREVENTION, 2017, 18 (06) : 650 - 656
  • [6] Modal Interaction-Enhanced Prompt Learning by Transformer Decoder for Vision-Language Models
    Liu, Mingyue
    Zhao, Honggang
    Ma, Longfei
    Li, Xiang
    Ji, Yucheng
    Li, Mingyong
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2023, 2023, 14120 : 163 - 174
  • [7] Modal interaction-enhanced prompt learning by transformer decoder for vision-language models
    Liu, Mingyue
    Zhao, Honggang
    Ma, Longfei
    Li, Mingyong
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2023, 12 (02)
  • [8] Modal interaction-enhanced prompt learning by transformer decoder for vision-language models
    Mingyue Liu
    Honggang Zhao
    Longfei Ma
    Mingyong Li
    International Journal of Multimedia Information Retrieval, 2023, 12
  • [9] Multimodal Event Causality Reasoning with Scene Graph Enhanced Interaction Network
    Liu, Jintao
    Wei, Kaiwen
    Liu, Chenglong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 8778 - 8786
  • [10] Pedestrian Collision Risk Assessment Based on State Estimation and Motion Prediction
    Zhang, Lin
    Yuan, Kang
    Chu, Hongqing
    Huang, Yanjun
    Ding, Haitao
    Yuan, Jiawei
    Chen, Hong
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (01) : 98 - 111