Fully Convolutional Encoder-Decoder With an Attention Mechanism for Practical Pedestrian Trajectory Prediction

被引:10
|
作者
Chen, Kai [1 ]
Song, Xiao [2 ]
Yuan, Haitao [3 ]
Ren, Xiaoxiang [4 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Mech & Elect Engn, Nanjing 210016, Peoples R China
[2] Beihang Univ, Sch Cyber Sci & Technol, Beijing 100191, Peoples R China
[3] New Jersey Inst Technol, Dept Elect & Comp Engn, Newark, NJ 07102 USA
[4] Wendong New Dist Middle Sch, Lvliang 032100, Shanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Trajectory; Predictive models; Feature extraction; Convolutional neural networks; Markov processes; Force; Convolution; Pedestrian behavior; convolution; long short-term memory (LSTM); attention mechanism;
D O I
10.1109/TITS.2022.3170874
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Pedestrian trajectory prediction using video is essential for many practical traffic applications. Most existing pedestrian trajectory prediction methods are based on fully connected long short-term memory (LSTM) networks and perform well on public datasets. However, these methods still have three defects: a) Most of them rely on manual annotations to obtain information about the environment surrounding the subject pedestrian, which limits practical applications; b) The interaction among pedestrians and obstacles in a scene is little studied, which leads to greater prediction error; c) Traditional LSTM methods are based on the previous moment and ignore the correlation between the future and distant past states of the pedestrian, which generates unrealistic trajectories. To tackle these problems, first, in the stage of data processing, we use an image semantic segmentation algorithm to obtain multi-category obstacle information and design an end-to-end ``Siamese Position Extraction'' model to obtain more accurate pedestrian interaction data. Second, we design an end-to-end fully convolutional LSTM encoder-decoder with an attention mechanism (FLEAM) to overcome the shortcomings of LSTM. Third, we compare FLEAM with several state-of-the-art LSTM-based prediction methods on multiple video sequences in the datasets ETH, UCY and MOT20. The results show that our approach generates the same prediction error as the best results of the state-of-the-art method. However, FLEAM has more potential for practice application because it does not rely on manually annotated data. We further validate the effectiveness of FLEAM by employing manually annotated data, finding that it generates much less prediction error.
引用
收藏
页码:20046 / 20060
页数:15
相关论文
共 50 条
  • [11] Hybrid Attention-Based Encoder-Decoder Fully Convolutional Network for PolSAR Image Classification
    Fang, Zheng
    Zhang, Gong
    Dai, Qijun
    Xue, Biao
    Wang, Peng
    REMOTE SENSING, 2023, 15 (02)
  • [12] Denoising Raman spectra using fully convolutional encoder-decoder network
    Loc, Irem
    Kecoglu, Ibrahim
    Unlu, Mehmet Burcin
    Parlatan, Ugur
    JOURNAL OF RAMAN SPECTROSCOPY, 2022, 53 (08) : 1445 - 1452
  • [13] Fully Convolutional Encoder-Decoder Architecture (FCEDA) for Skin Lesions Segmentation
    Adegun, Adekanmi
    Viriri, Serestina
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, PT I, 2019, 11683 : 426 - 437
  • [14] A Convolutional Encoder-Decoder Network With Skip Connections for Saliency Prediction
    Qi, Fei
    Lin, Chunhuan
    Shi, Guangming
    Li, Hao
    IEEE ACCESS, 2019, 7 : 60428 - 60438
  • [15] Pedestrian Detection at Night in Infrared Images Using an Attention-Guided Encoder-Decoder Convolutional Neural Network
    Chen, Yunfan
    Shin, Hyunchul
    APPLIED SCIENCES-BASEL, 2020, 10 (03):
  • [16] Aircraft Bleed Air System Fault Prediction based on Encoder-Decoder with Attention Mechanism
    Su, Siyu
    Sun, Youchao
    Peng, Chong
    Wang, Yifan
    EKSPLOATACJA I NIEZAWODNOSC-MAINTENANCE AND RELIABILITY, 2023, 25 (03):
  • [17] Embedding Encoder-Decoder With Attention Mechanism for Monaural Speech Enhancement
    Lan, Tian
    Ye, Wenzheng
    Lyu, Yilan
    Zhang, Junyi
    Liu, Qiao
    IEEE ACCESS, 2020, 8 : 96677 - 96685
  • [18] Enhancing lane changing trajectory prediction on highways: A heuristic attention-based encoder-decoder model
    Xiao, Xue
    Bo, Peng
    Chen, Yingda
    Chen, Yili
    Li, Keping
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2024, 639
  • [19] Recurrent Encoder-Decoder Networks for Vessel Trajectory Prediction With Uncertainty Estimation
    Capobianco, Samuele
    Forti, Nicola
    Millefiori, Leonardo Maria
    Braca, Paolo
    Willett, Peter
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (03) : 2554 - 2565
  • [20] Optimizing Fully Convolutional Encoder-Decoder Network for Segmentation of Diabetic Eye Disease
    Khan, Abdul Qadir
    Sun, Guangmin
    Li, Yu
    Bilal, Anas
    Manan, Malik Abdul
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (02): : 2481 - 2504