MMDA: Multi-person marginal distribution awareness for monocular 3D pose estimation

被引:2
|
作者
Liu, Sheng [1 ]
Shuai, Jianghai [1 ]
Li, Yang [1 ]
Du, Sidan [1 ,2 ]
机构
[1] Nanjing Univ, Sch Elect Sci & Engn, Nanjing, Jiangsu, Peoples R China
[2] Nanjing Univ, Sch Elect Sci & Engn, Nanjing 210023, Jiangsu, Peoples R China
关键词
3D human pose estimation; bottom-up method; marginal distribution awareness; multi-person pose estimation;
D O I
10.1049/ipr2.12783
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing 3D pose representations cannot completely decouple the overlapping two or more human joints of the same type. In this paper, the authors propose a novel 2.5 D representation of the human pose by projecting human joints in 3D space onto the three orthogonal planes. The authors apply for the first time the permutation module to a multi-person 3D human pose estimation task and use Geometric Constraints Loss (GCL) to guide the learning of the model. The authors overcome the negative effects of the inductive bias of convolutional neural networks (CNNs) by aligning the intermediate feature space with the output feature space. The effectiveness of the authors' approach is validated on the carnegie mellon university (CMU) panoptic dataset and MuPoTS-3D dataset. The authors' proposed representations can effectively decouple the human joints in their selected data from overlapping human joints.
引用
收藏
页码:2182 / 2191
页数:10
相关论文
共 50 条
  • [1] Multi-person 3D Pose Estimation from Monocular Image Sequences
    Li, Ran
    Xu, Nayun
    Lu, Xutong
    Xing, Yucheng
    Zhao, Haohua
    Niu, Li
    Zhang, Liqing
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 15 - 24
  • [2] Multi-Person 3D Human Pose Estimation from Monocular Images
    Dabral, Rishabh
    Gundavarapu, Nitesh B.
    Mitra, Rahul
    Sharma, Abhishek
    Ramakrishnan, Ganesh
    Jain, Arjun
    [J]. 2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 405 - 414
  • [3] Mutual Adaptive Reasoning for Monocular 3D Multi-Person Pose Estimation
    Zhang, Juze
    Wang, Jingya
    Shi, Ye
    Gao, Fei
    Xu, Lan
    Yu, Jingyi
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1788 - 1796
  • [4] PI-Net: Pose Interacting Network for Multi-Person Monocular 3D Pose Estimation
    Guo, Wen
    Corona, Enric
    Moreno-Noguer, Francesc
    Alameda-Pineda, Xavier
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2795 - 2805
  • [5] Monocular multi-person pose estimation: A survey
    dos Reis, Eduardo Souza
    Seewald, Lucas Adams
    Antunes, Rodolfo Stoffel
    Rodrigues, Vinicius Facco
    Righi, Rodrigo da Rosa
    da Costa, Cristiano Andre
    da Silveira Jr, Luiz Gonzaga
    Eskofier, Bjoern
    Maier, Andreas
    Horz, Tim
    Fahrig, Rebecca
    [J]. PATTERN RECOGNITION, 2021, 118
  • [6] Dual Networks Based 3D Multi-Person Pose Estimation From Monocular Video
    Cheng, Yu
    Wang, Bo
    Tan, Robby T. T.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 1636 - 1651
  • [7] Monocular 3D multi-person pose estimation via predicting factorized correction factors
    Guo, Yu
    Ma, Lichen
    Li, Zhi
    Wang, Xuan
    Wang, Fei
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 213
  • [8] Graph and Temporal Convolutional Networks for 3D Multi-person Pose Estimation in Monocular Videos
    Cheng, Yu
    Wang, Bo
    Yang, Bo
    Tan, Robby T.
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1157 - 1165
  • [9] Single-Shot Multi-Person 3D Pose Estimation From Monocular RGB
    Mehta, Dushyant
    Sotnychenko, Oleksandr
    Mueller, Franziska
    Xu, Weipeng
    Sridhar, Srinath
    Pons-Moll, Gerard
    Theobalt, Christian
    [J]. 2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 120 - 130
  • [10] Multi-person 3D Pose Estimation and Tracking in Sports
    Bridgeman, Lewis
    Volino, Marco
    Guillemaut, Jean-Yves
    Hilton, Adrian
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2487 - 2496