Long-Term Human Trajectory Prediction Using 3D Dynamic Scene Graphs

被引:0
|
作者
Gorlo, Nicolas [1 ]
Schmid, Lukas [1 ]
Carlone, Luca [1 ]
机构
[1] MIT, MIT SPARK Lab, Cambridge, MA 02139 USA
来源
基金
瑞士国家科学基金会; 芬兰科学院;
关键词
Trajectory; Probabilistic logic; Three-dimensional displays; Predictive models; Indoor environment; Planning; Cognition; Annotations; Service robots; Legged locomotion; AI-enabled robotics; human-centered robotics; service robotics; datasets for human motion; modeling and simulating humans; NAVIGATION;
D O I
10.1109/LRA.2024.3482169
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
We present a novel approach for long-term human trajectory prediction in indoor human-centric environments, which is essential for long-horizon robot planning in these environments. State-of-the-art human trajectory prediction methods are limited by their focus on collision avoidance and short-term planning, and their inability to model complex interactions of humans with the environment. In contrast, our approach overcomes these limitations by predicting sequences of human interactions with the environment and using this information to guide trajectory predictions over a horizon of up to 60s . We leverage Large Language Models (LLMs) to predict interactions with the environment by conditioning the LLM prediction on rich contextual information about the scene. This information is given as a 3D Dynamic Scene Graph that encodes the geometry, semantics, and traversability of the environment into a hierarchical representation. We then ground these interaction sequences into multi-modal spatio-temporal distributions over human positions using a probabilistic approach based on continuous-time Markov Chains. To evaluate our approach, we introduce a new semi-synthetic dataset of long-term human trajectories in complex indoor environments, which also includes annotations of human-object interactions. We show in thorough experimental evaluations that our approach achieves a 54% lower average negative log-likelihood and a 26.5% lower Best-of-20 displacement error compared to the best non-privileged (i.e., evaluated in a zero-shot fashion on the dataset) baselines for a time horizon of 60 s .
引用
收藏
页码:10978 / 10985
页数:8
相关论文
共 50 条
  • [31] Using Articulated Scene Models for Dynamic 3D Scene Analysis in Vista Spaces
    Beuter, Niklas
    Swadzba, Agnes
    Kummert, Franz
    Wachsmuth, Sven
    3D RESEARCH, 2010, 1 (03): : 1 - 13
  • [32] High Density, Long-Term 3D PTV Using 3D Scanning Illumination and Telecentric Imaging
    Kitzhofer, Jens
    Kirmse, Clemens
    Bruecker, Christoph
    IMAGING MEASUREMENT METHODS FOR FLOW ANALYSIS: RESULTS OF THE DFG PRIORITY PROGRAMME 1147 - IMAGING MEASUREMENT METHODS FOR FLOW ANALYSIS 2003-2009, 2009, 106 : 125 - 134
  • [33] Long-Term Trajectory Prediction Model Based on Transformer
    Tong, Qiang
    Hu, Jinqing
    Chen, Yuli
    Guo, Dongdong
    Liu, Xiulei
    IEEE ACCESS, 2023, 11 : 143695 - 143703
  • [34] Introvert: Human Trajectory Prediction via Conditional 3D Attention
    Shafiee, Nasim
    Padir, Taskin
    Elhamifar, Ehsan
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16810 - 16820
  • [35] Goal-driven Long-Term Trajectory Prediction
    Tran, Hung
    Le, Vuong
    Tran, Truyen
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 796 - 805
  • [36] A backward compatible 3D scene coding using residual prediction
    Shimizu, Shinya
    Kimata, Hideaki
    Kamikura, Kazuto
    Yashima, Yoshiyuki
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 1141 - 1144
  • [37] Incomplete 3D Motion Trajectory Segmentation and 2D-to-3D Label Transfer for Dynamic Scene Analysis
    Jiang, Cansen
    Paudel, Danda Pani
    Fougerolle, Yohan
    Fofi, David
    Demonceaux, Cedric
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 606 - 613
  • [38] Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions
    Wald, Johanna
    Dhamo, Helisa
    Navab, Nassir
    Tombari, Federico
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3960 - 3969
  • [39] 3D Mapping for a Reliable Long-Term Navigation
    Gines, Jonathan
    Martin, Francisco
    Matellan, Vicente
    Lera, Francisco J.
    Balsa, Jesus
    ROBOT 2017: THIRD IBERIAN ROBOTICS CONFERENCE, VOL 2, 2018, 694 : 283 - 294
  • [40] Long-term 3D epidermal organoid cultures
    不详
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2019, 139 (11) : 2250 - 2250