Dynamic Graph Reasoning for Multi-person 3D Pose Estimation

被引:5
|
作者
Qiu, Zhongwei [1 ]
Yang, Qiansheng [2 ]
Wang, Jian [2 ]
Fu, Dongmei [1 ]
机构
[1] Univ Sci & Technol Beijing, Beijing, Peoples R China
[2] Baidu, Beijing, Peoples R China
关键词
Human Pose Estimation; Multi-person; Graph Reasoning;
D O I
10.1145/3503161.3547846
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Multi-person 3D pose estimation is a challenging task because of occlusion and depth ambiguity, especially in the cases of crowd scenes. To solve these problems, most existing methods explore modeling body context cues by enhancing feature representation with graph neural networks or adding structural constraints. However, these methods are not robust for their single-root formulation that decoding 3D poses from a root node with a pre-defined graph. In this paper, we propose GR-M3D, which models the Multi-person 3D pose estimation with dynamic Graph Reasoning. The decoding graph in GR-M3D is predicted instead of pre-defined. In particular, It firstly generates several data maps and enhances them with a scale and depth aware refinement module (SDAR). Then multiple root keypoints and dense decoding paths for each person are estimated from these data maps. Based on them, dynamic decoding graphs are built by assigning path weights to the decoding paths, while the path weights are inferred from those enhanced data maps. And this process is named dynamic graph reasoning (DGR). Finally, the 3D poses are decoded according to dynamic decoding graphs for each detected person. GR-M3D can adjust the structure of the decoding graph implicitly by adopting soft path weights according to input data, which makes the decoding graphs be adaptive to different input persons to the best extent and more capable of handling occlusion and depth ambiguity than previous methods. We empirically show that the proposed bottom-up approach even outperforms top-down methods and achieves state-of-the-art results on three 3D pose datasets.
引用
收藏
页码:3521 / 3529
页数:9
相关论文
共 50 条
  • [1] Multi-Person 3D Pose Estimation With Occlusion Reasoning
    Chen, Xipeng
    Zhang, Junzheng
    Wang, Keze
    Wei, Pengxu
    Lin, Liang
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 878 - 889
  • [2] Mutual Adaptive Reasoning for Monocular 3D Multi-Person Pose Estimation
    Zhang, Juze
    Wang, Jingya
    Shi, Ye
    Gao, Fei
    Xu, Lan
    Yu, Jingyi
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1788 - 1796
  • [3] Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation
    Liu, Qihao
    Zhang, Yi
    Bai, Song
    Yuille, Alan
    [J]. COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 : 497 - 517
  • [4] Multi-person 3D Pose Estimation and Tracking in Sports
    Bridgeman, Lewis
    Volino, Marco
    Guillemaut, Jean-Yves
    Hilton, Adrian
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2487 - 2496
  • [5] AnimePose: Multi-person 3D pose estimation and animation
    Kumarapu, Laxman
    Mukherjee, Prerana
    [J]. PATTERN RECOGNITION LETTERS, 2021, 147 : 16 - 24
  • [6] Graph and Temporal Convolutional Networks for 3D Multi-person Pose Estimation in Monocular Videos
    Cheng, Yu
    Wang, Bo
    Yang, Bo
    Tan, Robby T.
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1157 - 1165
  • [7] Multi-Person Hierarchical 3D Pose Estimation in Natural Videos
    Gu, Renshu
    Wang, Gaoang
    Jiang, Zhongyu
    Hwang, Jenq-Neng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 4245 - 4257
  • [8] Multi-person 3D pose estimation from unlabelled data
    Rodriguez-Criado, Daniel
    Bachiller-Burgos, Pilar
    Vogiatzis, George
    Manso, Luis J.
    [J]. MACHINE VISION AND APPLICATIONS, 2024, 35 (03)
  • [9] Direct Multi-view Multi-person 3D Pose Estimation
    Wang, Tao
    Zhang, Jianfeng
    Cai, Yujun
    Yan, Shuicheng
    Feng, Jiashi
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] Multi-person 3D pose estimation from unlabelled data
    Daniel Rodriguez-Criado
    Pilar Bachiller-Burgos
    George Vogiatzis
    Luis J. Manso
    [J]. Machine Vision and Applications, 2024, 35