BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-Term Pose Forecasting

被引:2
|
作者
Mo, Shentong [1 ]
Xin, Miao [2 ]
机构
[1] Carnegie Mellon Univ, Elect & Comp Engn, Pittsburgh, PA 15213 USA
[2] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Long-term forecasting; spatial-temporal graph transformer; Bayesian transformer; uncertainty estimation;
D O I
10.1109/TMM.2023.3269219
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human pose forecasting that aims to predict the body poses happening in the future is an important task in computer vision. However, long-term pose forecasting is particularly challenging because modeling long-range dependencies across the spatial-temporal level is hard for joint-based representation. Another challenge is uncertainty prediction since the future prediction is not a deterministic process. In this article, we present a novel <bold>B</bold>ayesian <bold>S</bold>patial-<bold>T</bold>emporal <bold>G</bold>raph <bold>Trans</bold>former (BSTG-Trans) for predicting accurate, diverse, and uncertain future poses. First, we apply a spatial-temporal graph transformer as an encoder and a temporal-spatial graph transformer as a decoder for modeling the long-range spatial-temporal dependencies across pose joints to generate the long-term future body poses. Furthermore, we propose a Bayesian sampling module for uncertainty quantization of diverse future poses. Finally, a novel uncertainty estimation metric, namely Uncertainty Absolute Error is introduced for measuring both the accuracy and uncertainty of each predicted future pose. We achieve state-of-the-art performance against other baselines on Human3.6 M and HumanEva-I in terms of accuracy, diversity, and uncertainty for long-term pose forecasting. Moreover, our comprehensive ablation studies demonstrate the effectiveness and generalization of each module proposed in our BSTG-Trans.
引用
收藏
页码:673 / 686
页数:14
相关论文
共 50 条
  • [31] GTformer: Graph-Based Temporal-Order-Aware Transformer for Long-Term Series Forecasting
    Liang, Aobo
    Chai, Xiaolin
    Sun, Yan
    Guizani, Mohsen
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (19): : 31467 - 31478
  • [32] Constructing Geographic and Long-term Temporal Graph for Traffic Forecasting
    Sun, Yiwen
    Wang, Yulu
    Fu, Kun
    Wang, Zheng
    Zhang, Changshui
    Ye, Jieping
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3483 - 3490
  • [33] Multisize Patched Spatial-Temporal Transformer Network for Short-and Long-Term Crowd Flow Prediction
    Xie, Yulai
    Niu, Jingjing
    Zhang, Yang
    Ren, Fang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 21548 - 21568
  • [34] Spatial-Temporal Synchronous Graph Convolutional Networks: A New Framework for Spatial-Temporal Network Data Forecasting
    Song, Chao
    Lin, Youfang
    Guo, Shengnan
    Wan, Huaiyu
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 914 - 921
  • [35] TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking
    Chu, Peng
    Wang, Jiang
    You, Quanzeng
    Ling, Haibin
    Liu, Zicheng
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4859 - 4869
  • [36] Long-term correlation tracking via spatial-temporal context
    Chen, Zhi
    Liu, Peizhong
    Du, Yongzhao
    Luo, Yanmin
    Guo, Jing-Ming
    VISUAL COMPUTER, 2020, 36 (02): : 425 - 442
  • [37] GLOBALLY SPATIAL-TEMPORAL PERCEPTION: A LONG-TERM TRACKING SYSTEM
    Li, Zhenbang
    Wang, Qiang
    Gao, Jin
    Li, Bing
    Hu, Weiming
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2066 - 2070
  • [38] Preserving Dynamic Attention for Long-Term Spatial-Temporal Prediction
    Lin, Haoxing
    Bai, Rufan
    Jia, Weijia
    Yang, Xinyu
    You, Yongjian
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 36 - 46
  • [39] STGAFormer: Spatial-temporal Gated Attention Transformer based Graph Neural Network for traffic flow forecasting
    Geng, Zili
    Xu, Jie
    Wu, Rongsen
    Zhao, Changming
    Wang, Jin
    Li, Yunji
    Zhang, Chenlin
    INFORMATION FUSION, 2024, 105
  • [40] DTSFormer: Decoupled temporal-spatial diffusion transformer for enhanced long-term time series forecasting
    Zhu, Jiaming
    Liu, Dezhi
    Chen, Huayou
    Liu, Jinpei
    Tao, Zhifu
    KNOWLEDGE-BASED SYSTEMS, 2025, 309