Depth-Aware Multi-Person 3D Pose Estimation With Multi-Scale Waterfall Representations

被引:0
|
作者
Shen, Tianyu [1 ]
Li, Deqi [1 ]
Wang, Fei-Yue [2 ,3 ]
Huang, Hua [1 ]
机构
[1] Beijing Normal Univ, Sch Artificial Intelligence, Beijing 100875, Peoples R China
[2] Chinese Acad Sci, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
关键词
Three-dimensional displays; Pose estimation; Feature extraction; Location awareness; Cameras; Semantics; Solid modeling; Human depth perceiving; multi-person 3d pose estimation; multi-scale representation; occlusion handling;
D O I
10.1109/TMM.2022.3233251
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Estimating absolute 3D poses of multiple people from monocular image is challenging due to the presence of occlusions and the scale variation among different persons. Among the existing methods, the top-down paradigms are highly dependent on human detection which is prone to the influence from inter-person occlusions, while the bottom-up paradigms suffer from the difficulties in keypoint feature extraction caused by scale variation and unreliable joint grouping caused by occlusions. To address these challenges, we introduce a novel multi-person 3D pose estimation framework, aided by multi-scale feature representations and human depth perceiving. Firstly, a waterfall-based architecture is incorporated for multi-scale feature representations to achieve a more accurate estimation of occluded joints with a better detection of human shapes. Then the global and local representations are fused for handling the effects of inter-person occlusion and scale variation in depth perceiving and keypoint feature extraction. Finally, with the guidance of the fused multi-scale representations, a depth-aware model is exploited for better 2D joint grouping and 3D pose recovering. Quantitative and qualitative evaluations on benchmark datasets of MuCo-3DHP and MuPoTS-3D prove the effectiveness of our proposed method. Furthermore, we produce an occluded MuPoTS-3D dataset and the experiments on it validate the superiority of our method for overcoming the occlusions.
引用
收藏
页码:1439 / 1451
页数:13
相关论文
共 50 条
  • [1] Multi-person 3D Pose Estimation and Tracking in Sports
    Bridgeman, Lewis
    Volino, Marco
    Guillemaut, Jean-Yves
    Hilton, Adrian
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2487 - 2496
  • [2] Multi-person Absolute 3D Human Pose Estimation with Weak Depth Supervision
    Veges, Marton
    Lorincz, Andras
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT I, 2020, 12396 : 258 - 270
  • [3] Multi-Person 3D Pose Estimation With Occlusion Reasoning
    Chen, Xipeng
    Zhang, Junzheng
    Wang, Keze
    Wei, Pengxu
    Lin, Liang
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 878 - 889
  • [4] AnimePose: Multi-person 3D pose estimation and animation
    Kumarapu, Laxman
    Mukherjee, Prerana
    [J]. PATTERN RECOGNITION LETTERS, 2021, 147 : 16 - 24
  • [5] Direct Multi-view Multi-person 3D Pose Estimation
    Wang, Tao
    Zhang, Jianfeng
    Cai, Yujun
    Yan, Shuicheng
    Feng, Jiashi
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [6] Multi-Person Hierarchical 3D Pose Estimation in Natural Videos
    Gu, Renshu
    Wang, Gaoang
    Jiang, Zhongyu
    Hwang, Jenq-Neng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 4245 - 4257
  • [7] Dynamic Graph Reasoning for Multi-person 3D Pose Estimation
    Qiu, Zhongwei
    Yang, Qiansheng
    Wang, Jian
    Fu, Dongmei
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3521 - 3529
  • [8] Multi-person 3D pose estimation from unlabelled data
    Daniel Rodriguez-Criado
    Pilar Bachiller-Burgos
    George Vogiatzis
    Luis J. Manso
    [J]. Machine Vision and Applications, 2024, 35
  • [9] Multi-person 3D pose estimation from unlabelled data
    Rodriguez-Criado, Daniel
    Bachiller-Burgos, Pilar
    Vogiatzis, George
    Manso, Luis J.
    [J]. MACHINE VISION AND APPLICATIONS, 2024, 35 (03)
  • [10] Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation
    Fabbri, Matteo
    Lanzi, Fabio
    Calderara, Simone
    Alletto, Stefano
    Cucchiara, Rita
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 7202 - 7211