Multi-pedestrian trajectory prediction method based on multi-view 3D simulation video learning

被引:0
|
作者
Cao, Xingwen [1 ,2 ]
Zheng, Hongwei [1 ,2 ]
Liu, Ying [1 ,2 ]
Wu, Mengquan [3 ]
Wang, Lingyue [1 ,2 ]
Bao, Anming [1 ,2 ]
Chen, Xi [1 ,2 ]
机构
[1] State Key Laboratory of Desert and Oasis Ecology, Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi,830011, China
[2] College of Resources and Environment, University of Chinese Academy of Sciences, Beijing,100049, China
[3] School of Resources and Environmental Engineering, Ludong University, Yantai,264025, China
基金
中国国家自然科学基金;
关键词
Decoding - Forecasting - Geographic information systems - Information systems - Information use - Learning systems - Recurrent neural networks - Urban transportation;
D O I
10.11947/j.AGCS.2023.20220239
中图分类号
学科分类号
摘要
Multi-pedestrian trajectory prediction is one of the key factors in integrating urban geographic information system and intelligent transportation. To address the problems of insufficient training data, difficult labeling, and low accuracy of pedestrian trajectory prediction in multi-view scenes for existing methods, we propose a novel multi-pedestrian trajectory prediction method based on multi-view 3D simulation video learning. First, a simulation simulator is used to generate the required multi-view pedestrian trajectory annotation data. Then, we mix up the trajectory of the selected view and the adversarial trajectory by a convex combination function to generate the enhanced adversarial trajectory. Next, an advanced detection and tracking algorithm is used to encode and track pedestrian appearance information. Furthermore, the enhanced trajectory and coding information are used as the feature input of a graph attention recurrent neural network to model pedestrian interaction. Finally, the pedestrian trajectory is decoded by a position decoder to extract pedestrian motion characteristics, and multi-pedestrian trajectory prediction is completed. The ADE and FDE accuracies of our method on the ETH/UCY fixed-view dataset are 0.41 and 0.82, respectively. The ADE accuracy on the ActEV/VIRAT and Argoverse multi-view datasets is 17.74 and 65.4, and the FDE accuracy is 34.96 and 172.8. © 2023 SinoMaps Press. All rights reserved.
引用
收藏
页码:1595 / 1608
相关论文
共 50 条
  • [1] A COMPACT 3D REPRESENTATION FOR MULTI-VIEW VIDEO
    Salvador, Jordi
    Casas, Josep R.
    [J]. INTERNATIONAL CONFERENCE ON 3D IMAGING 2011 (IC3D 2011), 2011,
  • [2] Multi-view video compression for 3D displays
    Zwicker, Matthias
    Yea, Sehoon
    Vetro, Anthony
    Forlines, Clifton
    Matusik, Wojciech
    Pfister, Hanspeter
    [J]. CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 1506 - +
  • [3] Multi-view video coding based on view prediction
    An, Ping
    Guo, Qiuyan
    Mi, Tao
    Zhou, Li
    Zhang, Zhaoyang
    [J]. 2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1481 - 1485
  • [4] Neural 3D Video Synthesis from Multi-view Video
    Li, Tianye
    Slavcheva, Mira
    Zollhoefer, Michael
    Green, Simon
    Lassner, Christoph
    Kim, Changil
    Schmidt, Tanner
    Lovegrove, Steven
    Goesele, Michael
    Newcombe, Richard
    Lv, Zhaoyang
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5511 - 5521
  • [5] Adaptive Learning Based View Synthesis Prediction for Multi-View Video Coding
    Jinhui Hu
    Ruimin Hu
    Zhongyuan Wang
    Ge Gao
    Mang Duan
    Yan Gong
    [J]. Journal of Signal Processing Systems, 2014, 74 : 115 - 126
  • [6] Virtual View Adaptation for 3D Multi-View Video Streaming
    Petrovic, Goran
    Do, Luat
    Zinger, Sveta
    de With, Peter H. N.
    [J]. STEREOSCOPIC DISPLAYS AND APPLICATIONS XXI, 2010, 7524
  • [7] Adaptive Learning Based View Synthesis Prediction for Multi-View Video Coding
    Hu, Jinhui
    Hu, Ruimin
    Wang, Zhongyuan
    Gao, Ge
    Duan, Mang
    Gong, Yan
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 74 (01): : 115 - 126
  • [8] MULTI-VIEW METRIC LEARNING FOR MULTI-VIEW VIDEO SUMMARIZATION
    Wang, Linbo
    Fang, Xianyong
    Guo, Yanwen
    Fu, Yanwei
    [J]. 2016 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2016, : 179 - 182
  • [9] A Multi-view Projection 3D Video Processing Technology
    Dou, Yuchao
    Dai, Taotao
    Fu, Lei
    Huang, Ziqiang
    [J]. 2012 IEEE FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2012, : 1162 - 1165
  • [10] MULTI-VIEW 3D RECONSTRUCTION FROM VIDEO WITH TRANSFORMER
    Zhong, Yijie
    Sun, Zhengxing
    Sun, Yunhan
    Luo, Shoutong
    Wang, Yi
    Zhang, Wei
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1661 - 1665