Hierarchical Latent Structure for Multi-modal Vehicle Trajectory Forecasting

被引:12
|
作者
Choi, Dooseop [1 ]
Min, KyoungWook [1 ]
机构
[1] ETRI, Artificial Intelligence Res Lab, Daejeon, South Korea
来源
关键词
D O I
10.1007/978-3-031-20047-2_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Variational autoencoder (VAE) has widely been utilized for modeling data distributions because it is theoretically elegant, easy to train, and has nice manifold representations. However, when applied to image reconstruction and synthesis tasks, VAE shows the limitation that the generated sample tends to be blurry. We observe that a similar problem, in which the generated trajectory is located between adjacent lanes, often arises in VAE-based trajectory forecasting models. To mitigate this problem, we introduce a hierarchical latent structure into the VAE-based forecasting model. Based on the assumption that the trajectory distribution can be approximated as a mixture of simple distributions (or modes), the low-level latent variable is employed to model each mode of the mixture and the high-level latent variable is employed to represent the weights for the modes. To model each mode accurately, we condition the low-level latent variable using two lane-level context vectors computed in novel ways, one corresponds to vehicle-lane interaction and the other to vehicle-vehicle interaction. The context vectors are also used to model the weights via the proposed mode selection network. To evaluate our forecasting model, we use two large-scale real-world datasets. Experimental results show that our model is not only capable of generating clear multi-modal trajectory distributions but also outperforms the state-of-the-art (SOTA) models in terms of prediction accuracy. Our code is available at https://github.com/d1024choi/HLSTrajForecast.
引用
收藏
页码:129 / 145
页数:17
相关论文
共 50 条
  • [21] A FLEXIBLE TRAJECTORY COMPRESSION ALGORITHM FOR MULTI-MODAL TRANSPORTATION
    Mirvahabi, S. S.
    Abbaspour, R. Ali
    Claramunt, C.
    ISPRS GEOSPATIAL CONFERENCE 2022, JOINT 6TH SENSORS AND MODELS IN PHOTOGRAMMETRY AND REMOTE SENSING, SMPR/4TH GEOSPATIAL INFORMATION RESEARCH, GIRESEARCH CONFERENCES, VOL. 10-4, 2023, : 501 - 508
  • [22] Latent Space Model for Multi-Modal Social Data
    Cho, Yoon-Sik
    Ver Steeg, Greg
    Ferrara, Emilio
    Galstyan, Aram
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 447 - 458
  • [23] Multi-Modal Contextualization of Trajectory Data for Advanced Analysis
    Walther, Paul
    Deuser, Fabian
    Werner, Martin
    Datenbank-Spektrum, 2024, 24 (03) : 223 - 231
  • [24] Stability, structure and scale: improvements in multi-modal vessel extraction for SEEG trajectory planning
    Zuluaga, Maria A.
    Rodionov, Roman
    Nowell, Mark
    Achhala, Sufyan
    Zombori, Gergely
    Mendelson, Alex F.
    Cardoso, M. Jorge
    Miserocchi, Anna
    McEvoy, Andrew W.
    Duncan, John S.
    Ourselin, Sebastien
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2015, 10 (08) : 1227 - 1237
  • [25] Stability, structure and scale: improvements in multi-modal vessel extraction for SEEG trajectory planning
    Maria A. Zuluaga
    Roman Rodionov
    Mark Nowell
    Sufyan Achhala
    Gergely Zombori
    Alex F. Mendelson
    M. Jorge Cardoso
    Anna Miserocchi
    Andrew W. McEvoy
    John S. Duncan
    Sébastien Ourselin
    International Journal of Computer Assisted Radiology and Surgery, 2015, 10 : 1227 - 1237
  • [26] Bioinspired Hierarchical Ceramic Sutures for Multi-Modal Performance
    Katz, Zachary
    Sarvestani, Hamidreza Yazdani
    Gholipour, Javad
    Ashrafi, Behnam
    ADVANCED MATERIALS INTERFACES, 2023, 10 (14)
  • [27] Hierarchical multi-modal video summarization with dynamic sampling
    Yu, Lingjian
    Zhao, Xing
    Xie, Liang
    Liang, Haoran
    Liang, Ronghua
    IET IMAGE PROCESSING, 2024, 18 (14) : 4577 - 4588
  • [28] Landmark Classification With Hierarchical Multi-Modal Exemplar Feature
    Zhu, Lei
    Shen, Jialie
    Jin, Hai
    Xie, Liang
    Zheng, Ran
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (07) : 981 - 993
  • [29] Multi-perspective Multi-modal Trajectory Descriptions for Handwritten Strokes
    Parvez, Mohammad Tanvir
    Haque, Sardar Anisul
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 285 - 290
  • [30] An Integrity Assessment Framework for Multi-modal Vehicle Localization
    Balakrishnan, Arjun
    Rodriguez, Sergio A. F.
    Reynaud, Roger
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 2976 - 2983