Unsupervised universal hierarchical multi-person 3D pose estimation for natural scenes

被引:0
|
作者
Gu, Renshu [1 ]
Jiang, Zhongyu [2 ]
Wang, Gaoang [3 ]
McQuade, Kevin [4 ]
Hwang, Jenq-Neng [2 ]
机构
[1] Hangzhou Dianzi Univ, Comp & Software Sch, Hangzhou, Zhejiang, Peoples R China
[2] Univ Washington, Elect & Comp Engn, Seattle, WA 98195 USA
[3] Zhejiang Univ, Univ Illinois, Urbana Champaign Inst, Hailing, Zhejiang, Peoples R China
[4] Univ Washington, Sch Med, Seattle, WA USA
基金
中国国家自然科学基金;
关键词
3D human pose estimation; Monocular camera; Moving camera; Hierarchical human pose estimation; Motion capture;
D O I
10.1007/s11042-022-13079-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-person 3D pose estimation using a monocular freely moving camera in real-world scenarios remains a challenge. There is a lack of data with 3D ground truth, and real-world scenes usually contain self-occlusions and inter-person occlusions. To address these challenges, an unsupervised Universal Hierarchical 3D Human Pose Estimation (UH3DHPE) method that optimizes the torso and limb poses based on a hierarchical framework is proposed. To handle the case of an occluded or inaccurate 2D torso keypoints, which play an important role for 3D pose initialization and subsequent inference, an effective method to directly estimate limb poses without building upon the estimated torso pose is proposed, and the torso pose can then be further refined to form the hierarchy in a bottom-up fashion. An adaptive merging strategy is proposed to determine the best hierarchy. To verify the effectiveness of the proposed scheme, a video dataset for multi-person interactions is collected by a moving camera, under a Motion Capture (MoCap) ground truth data acquisition environment, for our performance evaluations. Experimental results show the proposed method outperforms state-of-the-art methods on the multi-person moving camera scenarios.
引用
收藏
页码:32883 / 32906
页数:24
相关论文
共 50 条
  • [31] Single-Stage is Enough: Multi-Person Absolute 3D Pose Estimation
    Jin, Lei
    Xu, Chenyang
    Wang, Xiaojuan
    Xiao, Yabo
    Guo, Yandong
    Nie, Xuecheng
    Zhao, Jian
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13076 - 13085
  • [32] Multi-person Absolute 3D Human Pose Estimation with Weak Depth Supervision
    Veges, Marton
    Lorincz, Andras
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT I, 2020, 12396 : 258 - 270
  • [33] Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement
    Cha, Junuk
    Saqlain, Muhammad
    Kim, GeonU
    Shin, Mingyu
    Baek, Seungryul
    [J]. COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 : 660 - 677
  • [34] Enhanced 3D Pose Estimation in Multi-Person, Multi-View Scenarios through Unsupervised Domain Adaptation with Dropout Discriminator
    Deng, Junli
    Yao, Haoyuan
    Shi, Ping
    [J]. SENSORS, 2023, 23 (20)
  • [35] Hierarchical pose net: spatial hierarchical body tree driven multi-person pose estimation
    Li, Haoran
    Yao, Hongxun
    Hou, Yuxin
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (2) : 6373 - 6392
  • [36] Hierarchical pose net: spatial hierarchical body tree driven multi-person pose estimation
    Haoran Li
    Hongxun Yao
    Yuxin Hou
    [J]. Multimedia Tools and Applications, 2024, 83 : 6373 - 6392
  • [37] Dual Networks Based 3D Multi-Person Pose Estimation From Monocular Video
    Cheng, Yu
    Wang, Bo
    Tan, Robby T. T.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 1636 - 1651
  • [38] Monocular 3D multi-person pose estimation via predicting factorized correction factors
    Guo, Yu
    Ma, Lichen
    Li, Zhi
    Wang, Xuan
    Wang, Fei
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 213
  • [39] Graph and Temporal Convolutional Networks for 3D Multi-person Pose Estimation in Monocular Videos
    Cheng, Yu
    Wang, Bo
    Yang, Bo
    Tan, Robby T.
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1157 - 1165
  • [40] A practical framework of multi-person 3D human pose estimation with a single RGB camera
    Ma, Le
    Lian, Sen
    Wang, Shandong
    Meng, Weiliang
    Xiao, Jun
    Zhang, Xiaopeng
    [J]. 2021 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES ABSTRACTS AND WORKSHOPS (VRW 2021), 2021, : 420 - 421