Fully Convolutional Encoder-Decoder With an Attention Mechanism for Practical Pedestrian Trajectory Prediction

被引:10
|
作者
Chen, Kai [1 ]
Song, Xiao [2 ]
Yuan, Haitao [3 ]
Ren, Xiaoxiang [4 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Mech & Elect Engn, Nanjing 210016, Peoples R China
[2] Beihang Univ, Sch Cyber Sci & Technol, Beijing 100191, Peoples R China
[3] New Jersey Inst Technol, Dept Elect & Comp Engn, Newark, NJ 07102 USA
[4] Wendong New Dist Middle Sch, Lvliang 032100, Shanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Trajectory; Predictive models; Feature extraction; Convolutional neural networks; Markov processes; Force; Convolution; Pedestrian behavior; convolution; long short-term memory (LSTM); attention mechanism;
D O I
10.1109/TITS.2022.3170874
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Pedestrian trajectory prediction using video is essential for many practical traffic applications. Most existing pedestrian trajectory prediction methods are based on fully connected long short-term memory (LSTM) networks and perform well on public datasets. However, these methods still have three defects: a) Most of them rely on manual annotations to obtain information about the environment surrounding the subject pedestrian, which limits practical applications; b) The interaction among pedestrians and obstacles in a scene is little studied, which leads to greater prediction error; c) Traditional LSTM methods are based on the previous moment and ignore the correlation between the future and distant past states of the pedestrian, which generates unrealistic trajectories. To tackle these problems, first, in the stage of data processing, we use an image semantic segmentation algorithm to obtain multi-category obstacle information and design an end-to-end ``Siamese Position Extraction'' model to obtain more accurate pedestrian interaction data. Second, we design an end-to-end fully convolutional LSTM encoder-decoder with an attention mechanism (FLEAM) to overcome the shortcomings of LSTM. Third, we compare FLEAM with several state-of-the-art LSTM-based prediction methods on multiple video sequences in the datasets ETH, UCY and MOT20. The results show that our approach generates the same prediction error as the best results of the state-of-the-art method. However, FLEAM has more potential for practice application because it does not rely on manually annotated data. We further validate the effectiveness of FLEAM by employing manually annotated data, finding that it generates much less prediction error.
引用
收藏
页码:20046 / 20060
页数:15
相关论文
共 50 条
  • [21] Detection of black box signal based on encoder-decoder fully convolutional networks
    Ji, Huazhong
    Zhou, Jie
    Pan, Xiang
    GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
  • [22] Microseismic Signal Denoising and Separation Based on Fully Convolutional Encoder-Decoder Network
    Zhang, Hang
    Ma, Chunchi
    Pazzi, Veronica
    Zou, Yulin
    Casagli, Nicola
    APPLIED SCIENCES-BASEL, 2020, 10 (18):
  • [23] Optimizing the Hyperparameters of Fully Convolutional Encoder-Decoder Networks for SAR Image Segmentation
    Liu, Yuanyue
    Zhao, Jin
    Fan, Jianchao
    Wang, Jun
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [24] Gated Convolutional Encoder-Decoder for Semi-supervised Affect Prediction
    Chawla, Kushal
    Khosla, Sopan
    Chhaya, Niyati
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT I, 2019, 11439 : 237 - 250
  • [25] Product Quality Prediction with Convolutional Encoder-Decoder Architecture and Transfer Learning
    Chih, Hao-Yi
    Fan, Yao-Chung
    Peng, Wen-Chih
    Kuo, Hai-Yuan
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 195 - 204
  • [26] An encoder-decoder model with embedded attention-mechanism for efficient meshfree prediction of slope failure
    Chen, Jun
    Wang, Dongdong
    Deng, Like
    Ying, Jijun
    INTERNATIONAL JOURNAL OF DAMAGE MECHANICS, 2023, 32 (10) : 1164 - 1187
  • [27] Aircraft Trajectory Prediction With Enriched Intent Using Encoder-Decoder Architecture
    Tran, Phu N.
    Nguyen, Hoang Q., V
    Pham, Duc-Thinh
    Alam, Sameer
    IEEE ACCESS, 2022, 10 : 17881 - 17896
  • [28] Convolutional Encoder-Decoder Networks for Robust Image-to-Motion Prediction
    Ridge, Barry
    Pahic, Rok
    Ude, Ales
    Morimoto, Jun
    ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, 2020, 980 : 514 - 523
  • [29] A novel approach for protein secondary structure prediction using encoder-decoder with attention mechanism model
    Sonsare, Pravinkumar M.
    Gunavathi, Chellamuthu
    BIOMOLECULAR CONCEPTS, 2024, 15 (01)
  • [30] Eyenet: Attention based Convolutional Encoder-Decoder Network for Eye Region Segmentation
    Kansal, Priya
    Nathan, Sabari
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3688 - 3693