BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-Term Pose Forecasting

被引:2
|
作者
Mo, Shentong [1 ]
Xin, Miao [2 ]
机构
[1] Carnegie Mellon Univ, Elect & Comp Engn, Pittsburgh, PA 15213 USA
[2] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Long-term forecasting; spatial-temporal graph transformer; Bayesian transformer; uncertainty estimation;
D O I
10.1109/TMM.2023.3269219
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human pose forecasting that aims to predict the body poses happening in the future is an important task in computer vision. However, long-term pose forecasting is particularly challenging because modeling long-range dependencies across the spatial-temporal level is hard for joint-based representation. Another challenge is uncertainty prediction since the future prediction is not a deterministic process. In this article, we present a novel <bold>B</bold>ayesian <bold>S</bold>patial-<bold>T</bold>emporal <bold>G</bold>raph <bold>Trans</bold>former (BSTG-Trans) for predicting accurate, diverse, and uncertain future poses. First, we apply a spatial-temporal graph transformer as an encoder and a temporal-spatial graph transformer as a decoder for modeling the long-range spatial-temporal dependencies across pose joints to generate the long-term future body poses. Furthermore, we propose a Bayesian sampling module for uncertainty quantization of diverse future poses. Finally, a novel uncertainty estimation metric, namely Uncertainty Absolute Error is introduced for measuring both the accuracy and uncertainty of each predicted future pose. We achieve state-of-the-art performance against other baselines on Human3.6 M and HumanEva-I in terms of accuracy, diversity, and uncertainty for long-term pose forecasting. Moreover, our comprehensive ablation studies demonstrate the effectiveness and generalization of each module proposed in our BSTG-Trans.
引用
收藏
页码:673 / 686
页数:14
相关论文
共 50 条
  • [41] Long-term Multi-dimensional Spatial-Temporal Graph Convolution for Urban Sensors Imputation and Augmentation
    Huang, Longji
    Huang, Jianbin
    Li, He
    30TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS, ACM SIGSPATIAL GIS 2022, 2022, : 524 - 527
  • [42] Spatial-temporal correlation graph convolutional networks for traffic forecasting
    Huang, Ru
    Chen, Zijian
    Zhai, Guangtao
    He, Jianhua
    Chu, Xiaoli
    IET INTELLIGENT TRANSPORT SYSTEMS, 2023, 17 (07) : 1380 - 1394
  • [43] Spatial-Temporal Graph Discriminant AutoEncoder for Traffic Congestion Forecasting
    Peng, Jiaheng
    Guan, Tong
    Liang, Jun
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 23 - 28
  • [44] Traffic Flow Forecasting with Spatial-Temporal Graph Diffusion Network
    Zhang, Xiyue
    Huang, Chao
    Xu, Yong
    Xia, Lianghao
    Dai, Peng
    Bo, Liefeng
    Zhang, Junbo
    Zheng, Yu
    35th AAAI Conference on Artificial Intelligence, AAAI 2021, 2021, 17A : 15008 - 15015
  • [45] Spatial-Temporal Graph ODE Networks for Traffic Flow Forecasting
    Fang, Zheng
    Long, Qingqing
    Song, Guojie
    Xie, Kunqing
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 364 - 373
  • [46] Customizing Spatial-Temporal Graph Mamba Networks for Pandemic Forecasting
    Xu, Haowei
    Gao, Chao
    Li, Xianghua
    Wang, Zhen
    Jun, Tanimoto
    PRICAI 2024: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2025, 15281 : 236 - 242
  • [47] Spatial-Temporal Bipartite Graph Attention Network for Traffic Forecasting
    Lakma, Dimuthu
    Perera, Kushani
    Borovica-Gajic, Renata
    Karunasekera, Shanika
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II, PAKDD 2024, 2024, 14646 : 68 - 80
  • [48] Deep spatial-temporal graph modeling for efficient NDVI forecasting
    Beyer, Martin
    Ahmad, Rehaan
    Yang, Brian
    Rodriguez-Bocca, Pablo
    SMART AGRICULTURAL TECHNOLOGY, 2023, 4
  • [49] Hybrid spatial-temporal graph neural network for traffic forecasting
    Wang, Peng
    Feng, Longxi
    Zhu, Yijie
    Wu, Haopeng
    INFORMATION FUSION, 2025, 118
  • [50] Traffic forecasting with graph spatial-temporal position recurrent network
    Chen, Yibi
    Li, Kenli
    Yeo, Chai Kiat
    Li, Keqin
    NEURAL NETWORKS, 2023, 162 : 340 - 349