DTG: Learning A Dynamic Token Graph for 3D Pose Forecasting

被引:0
|
作者
He, Yangliu [1 ]
Deng, Haoge [1 ]
Shen, Qiwei [1 ]
Liao, Jianxin [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
关键词
3D Pose Forecasting; Tokens Representation; Graph Network;
D O I
10.1007/978-3-031-72338-4_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D pose forecasting aims to predict future pose sequences based on historical poses, which has a wide range of practical applications. Previous methods mainly focus on representing body joints as 3D coordinates, yet ignoring the dependency modeling between joints. Furthermore, human forecasting is unstable when variations in time and pose type are considered. Therefore, we propose a Dynamic Token Graph Network (DTG) for 3D pose forecasting. First, to model the dependency between the body joints effectively, we represent joints by the composition of discrete tokens (see Fig. 1(b)) to replace 3D coordinates. Second, we design a novel dynamic graph neural network architecture to characterize the correlations of joints as time and pose type changes (e.g. Sitting and Walking). Comprehensive experiments on Human 3.6M, AMASS, and 3DPW datasets confirm the superiority of our method, which is applicable to both angle-based and coordinate-based pose representations.
引用
收藏
页码:118 / 129
页数:12
相关论文
共 50 条
  • [1] DGFormer: Dynamic graph transformer for 3D human pose estimation
    Chen, Zhangmeng
    Dai, Ju
    Bai, Junxuan
    Pan, Junjun
    PATTERN RECOGNITION, 2024, 152
  • [2] Dynamic Graph CNN with Attention Module for 3D Hand Pose Estimation
    Jiang, Xu
    Ma, Xiaohong
    ADVANCES IN NEURAL NETWORKS - ISNN 2019, PT I, 2019, 11554 : 87 - 96
  • [3] Dynamic Graph Reasoning for Multi-person 3D Pose Estimation
    Qiu, Zhongwei
    Yang, Qiansheng
    Wang, Jian
    Fu, Dongmei
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3521 - 3529
  • [4] Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation
    Zeng, Ailing
    Sun, Xiao
    Yang, Lei
    Zhao, Nanxuan
    Liu, Minhao
    Xu, Qiang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11416 - 11425
  • [5] Animal Pose Tracking: 3D Multimodal Dataset and Token-based Pose Optimization
    Mahir Patel
    Yiwen Gu
    Lucas C. Carstensen
    Michael E. Hasselmo
    Margrit Betke
    International Journal of Computer Vision, 2023, 131 : 514 - 530
  • [6] Animal Pose Tracking: 3D Multimodal Dataset and Token-based Pose Optimization
    Patel, Mahir
    Gu, Yiwen
    Carstensen, Lucas C.
    Hasselmo, Michael E.
    Betke, Margrit
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (02) : 514 - 530
  • [7] Generative 3D Part Assembly via Dynamic Graph Learning
    Huang, Jialei
    Zhan, Guanqi
    Fan, Qingnan
    Mo, Kaichun
    Shao, Lin
    Chen, Baoquan
    Guibas, Leonidas
    Dong, Hao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [8] POSECUT: Simultaneous segmentation and 3D pose estimation of humans using dynamic graph-cuts
    Bray, Matthieu
    Kohli, Pushmeet
    Torr, Philip H. S.
    COMPUTER VISION - ECCV 2006, PT 2, PROCEEDINGS, 2006, 3952 : 642 - 655
  • [9] Learning to Refine 3D Human Pose Sequences
    Mei, Jieru
    Chen, Xingyu
    Wang, Chunyu
    Yuille, Alan
    Lan, Xuguang
    Zeng, Wenjun
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 358 - 366
  • [10] GARNet: Graph Attention Residual Networks Based on Adversarial Learning for 3D Human Pose Estimation
    Chen, Zhihua
    Liu, Xiaoli
    Sheng, Bing
    Li, Ping
    ADVANCES IN COMPUTER GRAPHICS, CGI 2020, 2020, 12221 : 276 - 287