Hierarchical Latent Structure for Multi-modal Vehicle Trajectory Forecasting

被引:12
|
作者
Choi, Dooseop [1 ]
Min, KyoungWook [1 ]
机构
[1] ETRI, Artificial Intelligence Res Lab, Daejeon, South Korea
来源
关键词
D O I
10.1007/978-3-031-20047-2_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Variational autoencoder (VAE) has widely been utilized for modeling data distributions because it is theoretically elegant, easy to train, and has nice manifold representations. However, when applied to image reconstruction and synthesis tasks, VAE shows the limitation that the generated sample tends to be blurry. We observe that a similar problem, in which the generated trajectory is located between adjacent lanes, often arises in VAE-based trajectory forecasting models. To mitigate this problem, we introduce a hierarchical latent structure into the VAE-based forecasting model. Based on the assumption that the trajectory distribution can be approximated as a mixture of simple distributions (or modes), the low-level latent variable is employed to model each mode of the mixture and the high-level latent variable is employed to represent the weights for the modes. To model each mode accurately, we condition the low-level latent variable using two lane-level context vectors computed in novel ways, one corresponds to vehicle-lane interaction and the other to vehicle-vehicle interaction. The context vectors are also used to model the weights via the proposed mode selection network. To evaluate our forecasting model, we use two large-scale real-world datasets. Experimental results show that our model is not only capable of generating clear multi-modal trajectory distributions but also outperforms the state-of-the-art (SOTA) models in terms of prediction accuracy. Our code is available at https://github.com/d1024choi/HLSTrajForecast.
引用
收藏
页码:129 / 145
页数:17
相关论文
共 50 条
  • [31] Hierarchical system architecture for multi-agent multi-modal systems
    Koo, TJ
    PROCEEDINGS OF THE 40TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2001, : 1509 - 1514
  • [32] Exploring Complex Dependencies for Multi-modal Semantic Trajectory Prediction
    Liu, Jie
    Zhang, Lei
    Zhu, Shaojie
    Liu, Bailong
    Liang, Zhizheng
    Yang, Susong
    NEURAL PROCESSING LETTERS, 2022, 54 (02) : 961 - 985
  • [33] Kernel Trajectory Maps for Multi-Modal Probabilistic Motion Prediction
    Zhi, Weiming
    Ott, Lionel
    Ramos, Fabio
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [34] Unsupervised Trajectory Segmentation and Promoting of Multi-Modal Surgical Demonstrations
    Shao, Zhenzhou
    Zhao, Hongfa
    Xie, Jiexin
    Qu, Ying
    Guan, Yong
    Tan, Jindong
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 777 - 782
  • [35] Identification of parking spaces from multi-modal trajectory data
    Dey, Subhrasankha
    Winter, Stephan
    Goel, Salil
    Tomko, Martin
    TRANSACTIONS IN GIS, 2021, 25 (06) : 3088 - 3118
  • [36] Learning Pedestrian Group Representations for Multi-modal Trajectory Prediction
    Bae, Inhwan
    Park, Jin-Hwi
    Jeon, Hae-Gon
    COMPUTER VISION, ECCV 2022, PT XXII, 2022, 13682 : 270 - 289
  • [37] Latent correlation embedded discriminative multi-modal data fusion
    Zhu, Qi
    Xu, Xiangyu
    Yuan, Ning
    Zhang, Zheng
    Guan, Donghai
    Huang, Sheng-Jun
    Zhang, Daoqiang
    SIGNAL PROCESSING, 2020, 171
  • [38] Exploring Complex Dependencies for Multi-modal Semantic Trajectory Prediction
    Jie Liu
    Lei Zhang
    Shaojie Zhu
    Bailong Liu
    Zhizheng Liang
    Susong Yang
    Neural Processing Letters, 2022, 54 : 961 - 985
  • [39] Hmltnet: multi-modal fake news detection via hierarchical multi-grained features fused with global latent topic
    Shaoguo Cui
    Linfeng Gong
    Tiansong Li
    Neural Computing and Applications, 2025, 37 (7) : 5559 - 5575
  • [40] A Multi-vehicle Testbed for Multi-modal, Decentralized Sensing of the Environment
    Cortez, R. Andres
    Luna, Jose-Marcio
    Fierro, Rafael
    Wood, John
    2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 1088 - +