BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-Term Pose Forecasting

被引:2
|
作者
Mo, Shentong [1 ]
Xin, Miao [2 ]
机构
[1] Carnegie Mellon Univ, Elect & Comp Engn, Pittsburgh, PA 15213 USA
[2] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Long-term forecasting; spatial-temporal graph transformer; Bayesian transformer; uncertainty estimation;
D O I
10.1109/TMM.2023.3269219
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human pose forecasting that aims to predict the body poses happening in the future is an important task in computer vision. However, long-term pose forecasting is particularly challenging because modeling long-range dependencies across the spatial-temporal level is hard for joint-based representation. Another challenge is uncertainty prediction since the future prediction is not a deterministic process. In this article, we present a novel <bold>B</bold>ayesian <bold>S</bold>patial-<bold>T</bold>emporal <bold>G</bold>raph <bold>Trans</bold>former (BSTG-Trans) for predicting accurate, diverse, and uncertain future poses. First, we apply a spatial-temporal graph transformer as an encoder and a temporal-spatial graph transformer as a decoder for modeling the long-range spatial-temporal dependencies across pose joints to generate the long-term future body poses. Furthermore, we propose a Bayesian sampling module for uncertainty quantization of diverse future poses. Finally, a novel uncertainty estimation metric, namely Uncertainty Absolute Error is introduced for measuring both the accuracy and uncertainty of each predicted future pose. We achieve state-of-the-art performance against other baselines on Human3.6 M and HumanEva-I in terms of accuracy, diversity, and uncertainty for long-term pose forecasting. Moreover, our comprehensive ablation studies demonstrate the effectiveness and generalization of each module proposed in our BSTG-Trans.
引用
收藏
页码:673 / 686
页数:14
相关论文
共 50 条
  • [21] Enhanced spatial-temporal dynamics in pose forecasting through multi-graph convolution networks
    Ren, Hongwei
    Zhang, Xiangran
    Shi, Yuhong
    Liang, Kewei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (11) : 5453 - 5467
  • [22] Short-term power load forecasting based on spatial-temporal dynamic graph and multi-scale Transformer
    Zhu, Li
    Gao, Jingkai
    Zhu, Chunqiang
    Deng, Fan
    JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2025, 12 (02) : 92 - 111
  • [23] A Lightweight and Accurate Spatial-Temporal Transformer for Traffic Forecasting
    Li, Guanyao
    Zhong, Shuhan
    Deng, Xingdong
    Xiang, Letian
    Chan, S. -H. Gary
    Li, Ruiyuan
    Liu, Yang
    Zhang, Ming
    Hung, Chih-Chieh
    Peng, Wen-Chih
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (11) : 10967 - 10980
  • [24] An efficient spatial-temporal transformer with temporal aggregation and spatial memory for traffic forecasting
    Liu, Aoyu
    Zhang, Yaying
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 250
  • [25] Graph Spatial-Temporal Transformer Network for Traffic Prediction
    Zhao, Zhenzhen
    Shen, Guojiang
    Wang, Lei
    Kong, Xiangjie
    BIG DATA RESEARCH, 2024, 36
  • [26] Spatial-Temporal Transformer for Dynamic Scene Graph Generation
    Cong, Yuren
    Liao, Wentong
    Ackermann, Hanno
    Rosenhahn, Bodo
    Yang, Michael Ying
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 16352 - 16362
  • [27] Spatial-Temporal Aware Long-Term Object Tracking
    Zhang, Wei
    Kang, Baosheng
    Zhang, Shunli
    IEEE ACCESS, 2020, 8 : 71662 - 71684
  • [28] Spatial-Temporal Graph Attention Model on Traffic Forecasting
    Zhang, Xinlan
    Zhang, Zhenguo
    Jin, Xiaofeng
    2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 999 - 1003
  • [29] Optimization of spatial-temporal graph: A taxi demand forecasting model based on spatial-temporal tree
    Li, Jianbo
    Lv, Zhiqiang
    Ma, Zhaobin
    Wang, Xiaotong
    Xu, Zhihao
    INFORMATION FUSION, 2024, 104
  • [30] Spatial-temporal upsampling graph convolutional network for daily long-term traffic speed prediction
    Zhang, Song
    Liu, Yanbing
    Xiao, Yunpeng
    He, Rui
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 8996 - 9010