SCALE-Pose: Skeletal Correction and Language Knowledge-assisted for 3D Human Pose Estimation

被引:0
|
作者
Ma, Xinnan [1 ]
Li, Yaochen [1 ]
Zhao, Limeng [1 ]
Zhou, ChenXu [1 ]
Xu, Yuncheng [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Software Engn, Xian 710049, Peoples R China
关键词
3D human pose estimation; Transformer; Priori knowledge; Skeletal correction; Large language model;
D O I
10.1007/978-981-97-8795-1_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer-based 3D human pose estimation methods typically use 2D joint sequences as inputs, leveraging spatial and temporal transformer encoders to model the 3D human pose. However, these methods often neglect to incorporate skeletal constraints to limit joint motion, and few consider integrating prior category knowledge to enhance potential joint representations. To solve these problems, we propose a new method named SCALE-Pose. Firstly, this method incorporates the spatial and temporal skeleton correction blocks to improve the ability of modeling the long-range dependency of the spatiotemporal motion of specific skeletons. Next, a four-stream radian loss based on skeleton angle error is introduced to constrain the motion space of joints. Finally, an auxiliary method employs global-local prompts from a large language model to generate prior category knowledge, improving the ability to generalize prior category knowledge. Experimental results on Human3.6M and MPI-INF-3DHP datasets demonstrate that our method outperforms existing approaches.
引用
收藏
页码:578 / 592
页数:15
相关论文
共 50 条
  • [31] Unsupervised 3D Human Pose Estimation in Multi-view-multi-pose Video
    Sun, Cheng
    Thomas, Diego
    Kawasaki, Hiroshi
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5959 - 5964
  • [32] EM-POSE: 3D Human Pose Estimation from Sparse Electromagnetic Trackers
    Kaufmann, Manuel
    Zhao, Yi
    Tang, Chengcheng
    Tao, Lingling
    Twigg, Christopher
    Song, Jie
    Wang, Robert
    Hilliges, Otmar
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11490 - 11500
  • [33] MH Pose: 3D Human Pose Estimation based on High-quality Heatmap
    Zhou, Huifen
    Hong, Chaoqun
    Han, Yong
    Huang, Pengcheng
    Zhuang, Yanhui
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3215 - 3222
  • [34] Residual Pose: A Decoupled Approach for Depth-based 3D Human Pose Estimation
    Martinez-Gonzalez, Angel
    Villamizar, Michael
    Canevet, Olivier
    Odobez, Jean-Marc
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10313 - 10318
  • [35] Limb Pose Aware Networks for Monocular 3D Pose Estimation
    Wu, Lele
    Yu, Zhenbo
    Liu, Yijiang
    Liu, Qingshan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 906 - 917
  • [36] Real-time 3D human pose estimation without skeletal a priori structures
    Bai, Guihu
    Luo, Yanmin
    Pan, Xueliang
    Wang, Jia
    Guo, Jing-Ming
    IMAGE AND VISION COMPUTING, 2023, 132
  • [37] Towards Viewpoint Invariant 3D Human Pose Estimation
    Haque, Albert
    Peng, Boya
    Luo, Zelun
    Alahi, Alexandre
    Yeung, Serena
    Li Fei-Fei
    COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 160 - 177
  • [38] Adversarially Parameterized Optimization for 3D Human Pose Estimation
    Jack, Dominic
    Maire, Frederic
    Eriksson, Anders
    Shirazi, Sareh
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 145 - 154
  • [39] Application of 3D Human Pose Estimation for Behavioral Reproduction
    Dare, Kodjine
    Ben Abdessalem, Hamdi
    Frasson, Claude
    INTELLIGENT TUTORING SYSTEMS, ITS 2022, 2022, 13284 : 190 - 196
  • [40] 3D Human Pose Estimation with Spatial and Temporal Transformers
    Zheng, Ce
    Zhu, Sijie
    Mendieta, Matias
    Yang, Taojiannan
    Chen, Chen
    Ding, Zhengming
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11636 - 11645