SCALE-Pose: Skeletal Correction and Language Knowledge-assisted for 3D Human Pose Estimation

被引:0
|
作者
Ma, Xinnan [1 ]
Li, Yaochen [1 ]
Zhao, Limeng [1 ]
Zhou, ChenXu [1 ]
Xu, Yuncheng [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Software Engn, Xian 710049, Peoples R China
关键词
3D human pose estimation; Transformer; Priori knowledge; Skeletal correction; Large language model;
D O I
10.1007/978-981-97-8795-1_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer-based 3D human pose estimation methods typically use 2D joint sequences as inputs, leveraging spatial and temporal transformer encoders to model the 3D human pose. However, these methods often neglect to incorporate skeletal constraints to limit joint motion, and few consider integrating prior category knowledge to enhance potential joint representations. To solve these problems, we propose a new method named SCALE-Pose. Firstly, this method incorporates the spatial and temporal skeleton correction blocks to improve the ability of modeling the long-range dependency of the spatiotemporal motion of specific skeletons. Next, a four-stream radian loss based on skeleton angle error is introduced to constrain the motion space of joints. Finally, an auxiliary method employs global-local prompts from a large language model to generate prior category knowledge, improving the ability to generalize prior category knowledge. Experimental results on Human3.6M and MPI-INF-3DHP datasets demonstrate that our method outperforms existing approaches.
引用
收藏
页码:578 / 592
页数:15
相关论文
共 50 条
  • [41] 3D ASSISTED FACE RECOGNITION VIA PROGRESSIVE POSE ESTIMATION
    Zhang, Wuming
    Huang, Di
    Samaras, Dimitris
    Morvan, Jean-Marie
    Wang, Yunhong
    Chen, Liming
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 728 - 732
  • [42] Joint Human Pose Estimation and Stereo 3D Localization
    Deng, Wenlong
    Bertoni, Lorenzo
    Kreiss, Sven
    Alahi, Alexandre
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 2324 - 2330
  • [43] Group Spatial Attention for 3D Human Pose Estimation
    Tran, Tien-Dat
    Cao, Ge
    Ashraf, Russo
    Jo, Kang-Hyun
    2024 33RD INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, ISIE 2024, 2024,
  • [44] 3D Pictorial Structures for Multiple Human Pose Estimation
    Belagiannis, Vasileios
    Amin, Sikandar
    Andriluka, Mykhaylo
    Schiele, Bernt
    Navab, Nassir
    Ilic, Slobodan
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1669 - 1676
  • [45] 3D Human Pose Estimation With Spatial Structure Information
    Huang, Xiaoshan
    Huang, Jun
    Tang, Zengming
    IEEE ACCESS, 2021, 9 : 35947 - 35956
  • [46] A Survey on Depth Ambiguity of 3D Human Pose Estimation
    Zhang, Siqi
    Wang, Chaofang
    Dong, Wenlong
    Fan, Bin
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [47] 3D Human Pose Estimation via Intuitive Physics
    Tripathi, Shashank
    Mueller, Lea
    Huang, Chun-Hao P.
    Taheri, Omid
    Black, Michael J.
    Tzionas, Dimitrios
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4713 - 4725
  • [48] 3D human pose estimation with siamese equivariant embedding
    Veges, Marton
    Varga, Viktor
    Lorincz, Andras
    NEUROCOMPUTING, 2019, 339 : 194 - 201
  • [49] 3D Human Pose Estimation With Generative Adversarial Networks
    Xia, Hailun
    Xiao, Meng
    IEEE ACCESS, 2020, 8 : 206198 - 206206
  • [50] Generalizing Monocular 3D Human Pose Estimation in the Wild
    Wang, Luyang
    Chen, Yan
    Guo, Zhenhua
    Qian, Keyuan
    Lin, Mude
    Li, Hongsheng
    Ren, Jimmy S.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4024 - 4033