Exploring Severe Occlusion: Multi-Person 3D Pose Estimation with Gated Convolution

被引:17
|
作者
Gu, Renshu [1 ,3 ]
Wang, Gaoang [2 ]
Hwang, Jenq-Neng [3 ]
机构
[1] Hangzhou Dianzi Univ, Hangzhou 310018, Zhejiang, Peoples R China
[2] Zhejiang Univ Univ Illinois Urbana Champaign Inst, Haining 314400, Zhejiang, Peoples R China
[3] Univ Washington, Seattle, WA 98195 USA
关键词
D O I
10.1109/ICPR48806.2021.9412107
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D human pose estimation (HPE) is crucial in many fields, such as human behavior analysis, augmented reality/virtual reality (AR/VR) applications, and self-driving industry. Videos that contain multiple potentially occluded people captured from freely moving monocular cameras are very common in real-world scenarios, while 3D HPE for such scenarios is quite challenging, partially because there is a lack of such data with accurate 3D ground truth labels in existing datasets. In this paper, we propose a temporal regression network with a gated convolution module to transform 2D joints to 3D and recover the missing occluded joints in the meantime. A simple yet effective localization approach is further conducted to transform the normalized pose to the global trajectory. To verify the effectiveness of our approach, we also collect a new moving camera multi-human (MMHuman) dataset that includes multiple people with heavy occlusion captured by moving cameras. The 3D ground truth joints are provided by accurate motion capture (MoCap) system. From the experiments on static-camera based Human3.6M data and our own collected moving-camera based data, we show that our proposed method outperforms most state-of-the-art 2D-to-3D pose estimation methods, especially for the scenarios with heavy occlusions.
引用
收藏
页码:8243 / 8250
页数:8
相关论文
共 50 条
  • [31] Single-Stage is Enough: Multi-Person Absolute 3D Pose Estimation
    Jin, Lei
    Xu, Chenyang
    Wang, Xiaojuan
    Xiao, Yabo
    Guo, Yandong
    Nie, Xuecheng
    Zhao, Jian
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13076 - 13085
  • [32] Unsupervised universal hierarchical multi-person 3D pose estimation for natural scenes
    Renshu Gu
    Zhongyu Jiang
    Gaoang Wang
    Kevin McQuade
    Jenq-Neng Hwang
    [J]. Multimedia Tools and Applications, 2022, 81 : 32883 - 32906
  • [33] Multi-person Absolute 3D Human Pose Estimation with Weak Depth Supervision
    Veges, Marton
    Lorincz, Andras
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT I, 2020, 12396 : 258 - 270
  • [34] Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement
    Cha, Junuk
    Saqlain, Muhammad
    Kim, GeonU
    Shin, Mingyu
    Baek, Seungryul
    [J]. COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 : 660 - 677
  • [35] Unsupervised universal hierarchical multi-person 3D pose estimation for natural scenes
    Gu, Renshu
    Jiang, Zhongyu
    Wang, Gaoang
    McQuade, Kevin
    Hwang, Jenq-Neng
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 32883 - 32906
  • [36] Unsupervised Multi-view Multi-person 3D Pose Estimation Using Reprojection Error
    de Franca Silva, Diogenes Wallis
    Do Monte Lima, Joao Paulo Silva
    Macedo, David
    Zanchettin, Cleber
    Thomas, Diego Gabriel Francis
    Uchiyama, Hideaki
    Teichrieb, Veronica
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT III, 2022, 13531 : 482 - 494
  • [37] Dual Networks Based 3D Multi-Person Pose Estimation From Monocular Video
    Cheng, Yu
    Wang, Bo
    Tan, Robby T. T.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 1636 - 1651
  • [38] Monocular 3D multi-person pose estimation via predicting factorized correction factors
    Guo, Yu
    Ma, Lichen
    Li, Zhi
    Wang, Xuan
    Wang, Fei
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 213
  • [39] Graph and Temporal Convolutional Networks for 3D Multi-person Pose Estimation in Monocular Videos
    Cheng, Yu
    Wang, Bo
    Yang, Bo
    Tan, Robby T.
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1157 - 1165
  • [40] A practical framework of multi-person 3D human pose estimation with a single RGB camera
    Ma, Le
    Lian, Sen
    Wang, Shandong
    Meng, Weiliang
    Xiao, Jun
    Zhang, Xiaopeng
    [J]. 2021 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES ABSTRACTS AND WORKSHOPS (VRW 2021), 2021, : 420 - 421