Exploring Severe Occlusion: Multi-Person 3D Pose Estimation with Gated Convolution

被引:17
|
作者
Gu, Renshu [1 ,3 ]
Wang, Gaoang [2 ]
Hwang, Jenq-Neng [3 ]
机构
[1] Hangzhou Dianzi Univ, Hangzhou 310018, Zhejiang, Peoples R China
[2] Zhejiang Univ Univ Illinois Urbana Champaign Inst, Haining 314400, Zhejiang, Peoples R China
[3] Univ Washington, Seattle, WA 98195 USA
关键词
D O I
10.1109/ICPR48806.2021.9412107
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D human pose estimation (HPE) is crucial in many fields, such as human behavior analysis, augmented reality/virtual reality (AR/VR) applications, and self-driving industry. Videos that contain multiple potentially occluded people captured from freely moving monocular cameras are very common in real-world scenarios, while 3D HPE for such scenarios is quite challenging, partially because there is a lack of such data with accurate 3D ground truth labels in existing datasets. In this paper, we propose a temporal regression network with a gated convolution module to transform 2D joints to 3D and recover the missing occluded joints in the meantime. A simple yet effective localization approach is further conducted to transform the normalized pose to the global trajectory. To verify the effectiveness of our approach, we also collect a new moving camera multi-human (MMHuman) dataset that includes multiple people with heavy occlusion captured by moving cameras. The 3D ground truth joints are provided by accurate motion capture (MoCap) system. From the experiments on static-camera based Human3.6M data and our own collected moving-camera based data, we show that our proposed method outperforms most state-of-the-art 2D-to-3D pose estimation methods, especially for the scenarios with heavy occlusions.
引用
收藏
页码:8243 / 8250
页数:8
相关论文
共 50 条
  • [1] Multi-Person 3D Pose Estimation With Occlusion Reasoning
    Chen, Xipeng
    Zhang, Junzheng
    Wang, Keze
    Wei, Pengxu
    Lin, Liang
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 878 - 889
  • [2] Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation
    Liu, Qihao
    Zhang, Yi
    Bai, Song
    Yuille, Alan
    [J]. COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 : 497 - 517
  • [3] Multi-person 3D Pose Estimation and Tracking in Sports
    Bridgeman, Lewis
    Volino, Marco
    Guillemaut, Jean-Yves
    Hilton, Adrian
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2487 - 2496
  • [4] AnimePose: Multi-person 3D pose estimation and animation
    Kumarapu, Laxman
    Mukherjee, Prerana
    [J]. PATTERN RECOGNITION LETTERS, 2021, 147 : 16 - 24
  • [5] Multi-person pose estimation using atrous convolution
    Lin, Lu
    Wang, Yifei
    Wang, Lianghao
    Zhang, Ming
    Li, Dongxiao
    [J]. ELECTRONICS LETTERS, 2019, 55 (09) : 533 - +
  • [6] Multi-Person Hierarchical 3D Pose Estimation in Natural Videos
    Gu, Renshu
    Wang, Gaoang
    Jiang, Zhongyu
    Hwang, Jenq-Neng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 4245 - 4257
  • [7] Direct Multi-view Multi-person 3D Pose Estimation
    Wang, Tao
    Zhang, Jianfeng
    Cai, Yujun
    Yan, Shuicheng
    Feng, Jiashi
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [8] Dynamic Graph Reasoning for Multi-person 3D Pose Estimation
    Qiu, Zhongwei
    Yang, Qiansheng
    Wang, Jian
    Fu, Dongmei
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3521 - 3529
  • [9] Multi-person 3D pose estimation from unlabelled data
    Daniel Rodriguez-Criado
    Pilar Bachiller-Burgos
    George Vogiatzis
    Luis J. Manso
    [J]. Machine Vision and Applications, 2024, 35
  • [10] Multi-person 3D pose estimation from unlabelled data
    Rodriguez-Criado, Daniel
    Bachiller-Burgos, Pilar
    Vogiatzis, George
    Manso, Luis J.
    [J]. MACHINE VISION AND APPLICATIONS, 2024, 35 (03)