Long-Term Human Trajectory Prediction Using 3D Dynamic Scene Graphs

被引:0
|
作者
Gorlo, Nicolas [1 ]
Schmid, Lukas [1 ]
Carlone, Luca [1 ]
机构
[1] MIT, MIT SPARK Lab, Cambridge, MA 02139 USA
来源
基金
瑞士国家科学基金会; 芬兰科学院;
关键词
Trajectory; Probabilistic logic; Three-dimensional displays; Predictive models; Indoor environment; Planning; Cognition; Annotations; Service robots; Legged locomotion; AI-enabled robotics; human-centered robotics; service robotics; datasets for human motion; modeling and simulating humans; NAVIGATION;
D O I
10.1109/LRA.2024.3482169
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
We present a novel approach for long-term human trajectory prediction in indoor human-centric environments, which is essential for long-horizon robot planning in these environments. State-of-the-art human trajectory prediction methods are limited by their focus on collision avoidance and short-term planning, and their inability to model complex interactions of humans with the environment. In contrast, our approach overcomes these limitations by predicting sequences of human interactions with the environment and using this information to guide trajectory predictions over a horizon of up to 60s . We leverage Large Language Models (LLMs) to predict interactions with the environment by conditioning the LLM prediction on rich contextual information about the scene. This information is given as a 3D Dynamic Scene Graph that encodes the geometry, semantics, and traversability of the environment into a hierarchical representation. We then ground these interaction sequences into multi-modal spatio-temporal distributions over human positions using a probabilistic approach based on continuous-time Markov Chains. To evaluate our approach, we introduce a new semi-synthetic dataset of long-term human trajectories in complex indoor environments, which also includes annotations of human-object interactions. We show in thorough experimental evaluations that our approach achieves a 54% lower average negative log-likelihood and a 26.5% lower Best-of-20 displacement error compared to the best non-privileged (i.e., evaluated in a zero-shot fashion on the dataset) baselines for a time horizon of 60 s .
引用
收藏
页码:10978 / 10985
页数:8
相关论文
共 50 条
  • [1] 3D VSG: Long-term Semantic Scene Change Prediction through 3D Variable Scene Graphs
    Looper, Samuel
    Rodriguez-Puigvert, Javier
    Siegwart, Roland
    Cadena, Cesar
    Schmid, Lukas
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 8179 - 8186
  • [2] Dynamic Open-Vocabulary 3D Scene Graphs for Long-Term Language-Guided Mobile Manipulation
    Yan, Zhijie
    Li, Shufei
    Wang, Zuoxu
    Wu, Lixiu
    Wang, Han
    Zhu, Jun
    Chen, Lijiang
    Liu, Jihong
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (05): : 4252 - 4259
  • [3] 3D action recognition and long-term prediction of human motion
    Hahn, Markus
    Krueger, Lars
    Woehler, Christian
    COMPUTER VISION SYSTEMS, PROCEEDINGS, 2008, 5008 : 23 - 32
  • [4] Human Centered Scene Understanding Based on 3D Long-Term Tracking Data
    Planinc, Rainer
    Kampel, Martin
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 93 - 100
  • [5] Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs
    Feng, Shengyu
    Mostafa, Hesham
    Nassar, Marcel
    Majumdar, Somdeb
    Tripathi, Subarna
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5119 - 5128
  • [6] Long-term 3D map maintenance in dynamic environments
    Pomerleau, Francois
    Krusi, Philipp
    Colas, Francis
    Furgale, Paul
    Siegwart, Roland
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 3712 - 3719
  • [7] Synthesizing Long-Term 3D Human Motion and Interaction in 3D Scenes
    Wang, Jiashun
    Xu, Huazhe
    Xu, Jingwei
    Liu, Sifei
    Wang, Xiaolong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9396 - 9406
  • [8] SGAligner: 3D Scene Alignment with Scene Graphs
    Sarkar, Sayan Deb
    Miksik, Ondrej
    Pollefeys, Marc
    Barath, Daniel
    Armeni, Iro
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21870 - 21880
  • [9] Long-term 4D trajectory prediction using generative adversarial networks
    Wu, Xiping
    Yang, Hongyu
    Chen, Hu
    Hu, Qinzhi
    Hu, Haoliang
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2022, 136
  • [10] Human trajectory prediction in crowded scene using social-affinity Long Short-Term Memory
    Pei, Zhao
    Qi, Xiaoning
    Zhang, Yanning
    Ma, Miao
    Yang, Yee-Hong
    PATTERN RECOGNITION, 2019, 93 : 273 - 282