Multimodal based attention-pyramid for predicting pedestrian trajectory

被引:1
|
作者
Yan, Xue [1 ]
Yang, Jinfu [1 ,2 ]
Liu, Yubin [1 ]
Song, Lin [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[2] Beijing Key Lab Computat Intelligence & Intellige, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
trajectory prediction; attention mechanism; recurrent neural network; multimodal fusion;
D O I
10.1117/1.JEI.31.5.053008
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The goal of pedestrian trajectory prediction is to predict the future trajectory according to the historical one of pedestrians. Multimodal information in the historical trajectory is conducive to perception and positioning, especially visual information and position coordinates. However, most of the current algorithms ignore the significance of multimodal information in the historical trajectory. We describe pedestrian trajectory prediction as a multimodal problem, in which historical trajectory is divided into an image and coordinate information. Specifically, we apply fully connected long short-term memory (FC-LSTM) and convolutional LSTM (ConvLSTM) to receive and process location coordinates and visual information respectively, and then fuse the information by a multimodal fusion module. Then, the attention pyramid social interaction module is built based on information fusion, to reason complex spatial and social relations between target and neighbors adaptively. The proposed approach is validated on different experimental verification tasks on which it can get better performance in terms of accuracy than other counterparts. (c) 2022 SPIE and IS&T
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Temporal Attention-Pyramid Pooling for Temporal Action Detection
    Gan, Ming-Gang
    Zhang, Yan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3799 - 3810
  • [2] SGAMTE-Net: A pedestrian trajectory prediction network based on spatiotemporal graph attention and multimodal trajectory endpoints
    Xin, Yang
    Liao, Bingxian
    Wang, Xiangcheng
    APPLIED INTELLIGENCE, 2023, 53 (24) : 31165 - 31180
  • [3] SGAMTE-Net: A pedestrian trajectory prediction network based on spatiotemporal graph attention and multimodal trajectory endpoints
    Xin Yang
    Liao Bingxian
    Wang Xiangcheng
    Applied Intelligence, 2023, 53 : 31165 - 31180
  • [4] Temporal Pyramid Network With Spatial-Temporal Attention for Pedestrian Trajectory Prediction
    Li, Yuanman
    Liang, Rongqin
    Wei, Wei
    Wang, Wei
    Zhou, Jiantao
    Li, Xia
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (03): : 1006 - 1019
  • [5] Pedestrian Trajectory Prediction Using a Social Pyramid
    Xue, Hao
    Huynh, Du Q.
    Reynolds, Mark
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11671 : 439 - 453
  • [6] Pedestrian Trajectory Prediction Based on GAN and Attention Mechanism
    Ouyang Jun
    Shi Qingwei
    Wang Xinxin
    Wang Liang
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (14)
  • [7] Probabilistic Crowd GAN: Multimodal Pedestrian Trajectory Prediction Using a Graph Vehicle-Pedestrian Attention Network
    Eiffert, Stuart
    Li, Kunming
    Shan, Mao
    Worrall, Stewart
    Sukkarieh, Salah
    Nebot, Eduardo
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04): : 5026 - 5033
  • [8] Multimodal Transformer Network for Pedestrian Trajectory Prediction
    Yin, Ziyi
    Liu, Ruijin
    Xiong, Zhiliang
    Yuan, Zejian
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1259 - 1265
  • [9] Vehicle Pedestrian Detection Method Based on Spatial Pyramid Pooling and Attention Mechanism
    Guo, Mingtao
    Xue, Donghui
    Li, Peng
    Xu, He
    INFORMATION, 2020, 11 (12) : 1 - 15
  • [10] Pedestrian trajectory prediction based on spatio-temporal attention mechanism
    Hu, Jun
    Yang, Xinyu
    Yan, Liang
    Zhang, Qinghua
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (08) : 3299 - 3312